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GENES ENCODING ENZYMES IN THE BIOSYNTHESIS OF PIMARICIN AND 

THE APPLICATION THEREOF 

Field of the inventiiQn 
5 The invention relates to novel genes encoding 

enzymes which are fundamental' in the biosynthesis of 
pimaricin. The invention further relates the application of 
said gene for modifying the biosynthesis of pimaricin. It 
also relates to the biosynthesis of new compounds . 

10 

Background of the inventloTi 
Polyketides, such as pimaricin (in the literature 
also referred to as natamycin, see for its structure Fig. 
3A) , form a large and highly diverse group of natural 

15 products. Members of the said group include compounds having 
antibacterial, antifungal, anticancer, antiparasitic and 
immunosuppressant activities . 

Despite their structural diversity, these 
metabolites are believed to be synthesized by micro- 

2 0 organisms by a common pathway in which units derived from 

acetate, propionate or butyrate are condensed onto a growing 
chain by a polyketide synthase (PKS) . The process resembles 
fatty acid biosynthesis, except that the p-keto function 
introduced at each elongation step may undergo all, part or 

25 none of a reductive cycle comprising P-ketoreduction, 
dehydration and enoylreduction . Structural variety of 
polyketides arises from the choice of monomers, the extent 
of p-ketoreduction and dehydration, and the stereochemistry 
of each chiral center. Yet further diversity is produced by 

30 functionalization of the polyketide chain by the action of 
glycosylases, methyltransf erases and oxidative enzymes. 

Modification of complex biomolecules , such as 
polyketides, is increasingly an important way of obtaining 
biologically active compounds with improved or altered 

35 properties. Currently, these modifications are usually 

introduced by chemical methods in a directed or random (e.g. 
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in combinatorial chemistry) manner. A drawback of these 
chemical methods is that they are often performed under 
relatively harsh conditions and furthermore, they lack 
selectivity and/or sensitivity. Particularly, in the case of 
5 complex biomolecules having multiple f unctionalized, 

reactive groups, precautions have to be taken in order to 
avoid undesired side reactions. These precautions include 
for instance the introduction of protective groups before a 
desired chemical conversion. Consequently two additional 

10 process steps are involved, as the protective groups must be 
removed afterwards . 

Bioconversion of simple organic compounds, i.e. 
compounds with no or single reactive centers, has been known 
for some time and has been widely applied. Examples are the 

15 oxidation of long chain alkanes using alkane hydroxylation 
systems of Pseudowonss , and epoxidation of alkenes using 
enzyme systems from various micro-organisms. However, for 
the specific modifications required in the biosynthesis of 
complex molecules, for example, P- lactam antibiotics, 

20 polyketide antibiotics, anticancer agents, or peptide 

antibiotics, the large amounts of reactive groups present in 
those molecules are problematic for even the simplest 
treatments, such as hydrolysis of specific bonds. More 
complicated treatments frequently completely destroy the 

25 molecule. 

The present invention is based on the identification 
and isolation of three genes which encode enzymes which 

3 0 facilitate specific oxidative conversions in the 

biosynthesis of pimaricin. The present invention thus 
provides the means to perform specific conversions in 
complex biomolecules, in particular in polyketides, without 
applying the harsh conditions often related to chemical 

35 modifications. The said conversions can be carried as part 
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of a biosynthesis of said biomolecules , for instance in 
micro-organisms . 

Surprisingly, it has been found that the expression 
of polynucleotides of the invention in different micro- 
organisms, can lead to the biosynthesis of different 
biomolecules . It has further been found that expression of 
the said polynucleotides may be switched off (or knocked 
out) in Streptomyces which is usually used for the 
biosynthesis of pimaricin. In this embodiment, no pimaricin 
is produced by said Streptomyces , but instead a modified 
biomolecule is produced. In addition, it has been found that 
the polynucleotides may be overexpressed in Streptomyces , 
leading to an increase in the biosynthesis of pimaricin in 
the said Streptomyces . 

According to the invention there is thus provided a 
polynucleotide comprising: 

i) a nucleic acid sequence set out in SEQ ID NO: 5, 7 or 9 
or a sequence complementary thereto; or 

ii) a homologue or fragment of a sequence defined in i) . 
The invention also provides: 

a polypeptide encoded by a polynucleotide of the 
invention which is preferably isolated and/or purified; 
a polypeptide obtainable by a polynucleotide of the 
invention in a cell which is a Streptomyces (including 
e.g. S .natal ens is) cell or a cell of a heterologous 
species 

a polypeptide comprising the amino acid sec[uence set out 
in SEQ ID NO: 6, 8 or 9 or a homologue or fragment 
thereof ; 

a recombinant cell comprising at least one additional 
copy of a polynucleotide of the invention, wherein the 
cell naturally possesses at least one said 
polynucleotide ; 

a recombinant cell, wherein a polynucleotide of the 
invention which naturally occurs in the cell has been 
inactivated; 
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a recombinant cell comprising a polynucleotide according 
to the invention which polynucleotide does not naturally 
occur in that cell or where the polynucleotide is 
heterologous to that cell; 

a method for overexpressing a polynucleotide encoding a 
polypeptide according to the invention in Streptowyces 
cell which method comprises: 

i) attaching a promoter sequence to the said 

polynucleotide ; 
ii) transferring the resulting promoter-polynucleotide 

complex into the said cell; and 
iii) maintaining the resulting cell under conditions 

suitable for expression of the said polynucleotide; 
a method for inactivating a polynucleotide encoding a 
polypeptide according to the invention in a Streptowyces 
cell which method comprises disrupting the coding 
sequence of the said polynucleotide; 

a method for expressing a polynucleotide encoding a 
polypeptide according to the invention in a heterologous 
cell which method comprises : 

i) attaching a promoter sequence to the said 
polynucleotide ; 

ii) transferring the resulting promoter-polynucleotide 
complex into the said cell; and 

iii) maintaining the resulting cell under conditions 
suitable for expression of the said polynucleotide; 

a method for producing pimaricin which method comprises 
maintaining a recombinant cell according to the invention 
under conditions suitable for obtaining expression of the 
additional copy of a polynucleotide according to the 
invention and isolating the said pimaricin; 
a method for producing a biomolecule which method 
comprises maintaining a recombinant cell according to the 
invention under conditions which would be suitable for 
obtaining expression of the inactivated polynucleotide 
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had it not been inactivated and isolating the said 
biomolecule ; 

a method for producing a biomolecule which method 
comprises maintaining a recombinant cell according to the 
5 invention under conditions suitable for obtaining 

expression of the polynucleotide which does not naturally 
occur in the cell and isolating the said biomolecule; 
a biomolecule obtainable by a method of the invention for 
producing a biomolecule; 
10 - use of a recombinant cell of the invention in the 
production of pimaricin; 

use of a recombinant cell of the invention in the 
production of a biomolecule; 

a vector containing a polynucleotide of the invention 
15 which is capable of expressing a polypeptide of the 

invention; 

a cell harbouring a vector of the invention; and 
- a method for producing a polypeptide of the invention, 
which method comprises maintaining a recombinant cell 
2 0 according to the invention under conditions suitable for 

obtaining expression of the polypeptide and isolating the 
said polypeptide. 

use of a isolated and/or purified polypeptide according 
to the invention for the oxidative modification of a 
25 methyl group of a suitable compound. 

Brief description of the drawings 
Figure 1 : Physical map of part of the Pimaricin 
biosynthetic cluster . 
30 Genes: locations of the genes encoding 

polyketide synthases and oxidative genes involved in 
Pimaricin production (not drawn to scale) ; 

Probes: 0.7 indicates the location of the 0.7 
kb fragment used to identify the extent of polyketide 
35 synthase encoding regions; 3.3 indicates the location of the 
3.3 kb fragment used in polyketide synthase gene disruption; 
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Cosmids : sizes and numbers of available 
cosmids covering the chromosomal region encompassing the 
oxidative genes. 

Figure 2: Detailed physical map of the chromosomal 
regions including the oxidative genes. 



Figure 3B: Molecular structures of Pimaricin 
derivatives with a reduced oxidation state of C4 and C5 
and/or the carboxyl group at C12 . 

Figure 4 : Molecular structures of Amphotericin B and 

Nystatin 

Figure 5: 5 illustrates the conversion of the 
triketide lactone to it oxidized form by the action of pORFl 
and pORF2 



SEQ ID 1 shows the nucleotide sequence and derived amino 
acid sequence of a first Pimaricin biosynthesis associated 
polyketide synthase gene 

SEQ ID 2 shows the amino acid sequence of a first Pimaricin 
biosynthesis associated polyketide synthase 
SEQ ID 3 shows the nucleotide sequence and derived amino 
acid sequence of a second Pimaricin biosynthesis associated 
polyketide synthase gene 

SEQ ID 4 shows the amino acid sequence of a second Pimaricin 
biosynthesis associated polyketide synthase 
SEQ ID 5 shows the nucleotide sequence and derived amino 
acid sequence of ORFl, an oxidative gene involved in 
Pimaricin biosynthesis 

SEQ ID 6 shows the amino acid sequence of an oxidation 

enzyme pORFl involved in Pimaricin biosynthesis 

SEQ ID 7 shows the nucleotide sequence and derived amino 



Figure 3A: Molecular structure of Pimaricin. 



Description of the sequence listings 
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acid sequence of ORF2 , an oxidative gene involved in 
Pimaricin biosynthesis 

SEQ ID 8 shows the amino acid sequence of an oxidation 
enzyme pORF2 involved in Pimaricin biosynthesis 
5 SEQ ID 9 shows the nucleotide sequence and derived amino 
acid sequence of 0RF3 , an oxidative gene involved in 
Pimaricin biosynthesis 

SEQ ID 10 shows the amino acid sequence of an oxidation 
enzyme pORF3 involved in Pimaricin biosynthesis 
10 SEQ ID 11 shows a synthetic oligonucleotide (forward primer) 
for isolation by PGR of the ermE promoter of 
Sa ccharopolyspora erythraea 

SEQ ID 12 shows a synthetic oligonucleotide (reverse primer) 
for isolation by PGR of the ermE promoter of 
15 Saccharopolyspora erythraea 

SEQ ID 13 shows a synthetic oligonucleotide (forward primer) 
for isolation by PGR of the N-terminal region of ORFl 
SEQ ID 14 shows a synthetic oligonucleotide (reverse primer) 
for isolation by PGR of the N-terminal region of ORFl 

20 

Detailed description of the invention 
Three open-reading frames (ORFs) were identified 
from the Pimaricin producing microorganism Streptomyces 
natalensis . The three ORFs are associated with polyketide 
25 synthese genes and each ORF has been shown to be essential 
for pimaricin biosynthesis. 

The functionality of the Pimaricin PKS associated 
genes was initially pursued by comparing their derived amino 
acid sequences with those present in public databases like 
3 0 EMBL, Genbank, NBRF/PIR, or Swissprot . 

Surprisingly, ORFl appeared to resemble cholesterol 
oxidases from several Streptomyces species . The close 
association of ORFl with the Pimaricin PKS suggests an 
oxidative step in Pimaricin tailoring. A methyloxidase 
35 encoding gene has not been observed previously in a 
polyketide biosynthesis gene cluster. 
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Based on similar analyses, 0RF2 and ORF3 resemble 
cytochrome P4 5 0 dependent monooxygenases from various 
sources. With respect to the biosynthesis of bioactive 
compounds, P450 dependent monooxygenases have been 
5 identified before in association with polyketide gene 
clusters, e.g. in the Erythromycin and Rapamycin 
biosynthesis gene clusters. However, only in the 
Erythromycin case has a specific enzymatic action on 
Erythromycin precursor compounds been proven. Essentially 

10 all known cases of tailoring oxidation steps act on 

secondary carbon atoms (methylene groups) . Oxidation of 
primary carbon. atoms (methy] groups) to carboxylic acid 
function in polyketide biosynthesis, as has presently been 
found, is unprecedented. Nothing is known about the 

15 molecular basis of epoxide formation in polyketide products, 
though epoxides are present in a few known structures - 

Thus, the invention provides a polynucleotide which 
comprises : 

20 i) a nucleic acid sequence set out in SEQ ID NO: 5, 7, or 

9 or a sequence complementary thereto; or 
ii) a homologue or fragment of a sequence defined in i) . 

Polynucleotides of the invention may comprise DNA or 
RNA. The invention also provides double stranded 

25 polynucleotides comprising a polynucleotide of the invention 
and its complement . 

Homologues of a nucleic acid sequence set out in SEQ 
ID NO: 5, 7 or 9 are polynucelotideds which do not share 
100% sequence identity with a sequence set out in SEQ ID NO: 

3 0 5, 7, or 9, but which do encode polypeptides having a 
similar enzyme activity to a polypeptide encoded by a 
nucleic acid sequence set out in SEQ ID NO: 5, 7 or 9 . Thus 
a homolog of a polypeptide encoded by SEQ ID NO: 5 will 
typically encode a polypeptide which has methyl oxidase or 

35 methyloxidase-like activity. A homologue of a polypeptide 
encoded by SEQ ID NO: 7 or 9 will typically encode a 
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polynucleotide which has cytochrome P-450 monocxygenase 
activity or cytochrome P-450 monocxygenase- like activity. 
A homologue of the invention will generally have at least 
90%, at least 95%, at least 98% or at least 99% sequence 
5 identity to the sequence of SEQ ID NO: 5, 7 or 9 over a 
region of at least 60, more preferably at least 100 
contiguous nucleotides or most preferably over the full 
length of SEQ ID NO: 5, 7 or 9 (for determination of 
sequence identity see D.J. Lipman, W.R. Pearson. 1985. 

10 Science 227, pl435) . 

Any combination of the above mentioned degrees of 
sequence identity and minimum sizes may be used to define 
polynucleotides of the invention, with the more stringent 
combinations (i.e. higher sequence identity over longer 

15 lengths) being preferred. Thus, for example a polynucleotide 
which has at least 90% sequence identity over 60, fonns one 
aspect of the invention, as does a polynucleotide which has 
at least 95% sequence identity over 100 nucleotides. 

The sequence of SEQ ID NO: 5, 7 or 9 may be modified 

20 by nucleotide substitutions, for example from 1, 2 or 3 to 
10 or 25 substitutions. The polynucleotide of SEQ ID NO: 5, 
7 or 9 may alternatively or additionally be modified by one 
or more insertions and/or deletions and/or by an extension 
at either or both ends. The modified polynucleotide 

25 generally encodes a polypeptide which has methyl oxidase or 
cytochrome P-4 50 monocxygenase activity. 

Degenerate substitutions may be made and/or substitutions 
may be made which would result in a conservative amino acid 
substitution when the modified sequence is translated, for 

3 0 example as shown in the Table below. 

Polynucleotides of the invention include fragments 
of a sequence set out in SEQ ID NO: 5, 7 or 9 . Thus, 
polynucleotides of the invention may be used as a primer, 
e.g. a PGR primer, a primer for an alternative amplification 

35 reaction, a probe e.g. labeled with a revealing label by 
conventional means using radioactive or non- radioactive 
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labels, or the polynucleotides may be cloned into vectors 
(M.A. Innis et al.,1990. PCR Protocols, Academic Press Inc) . 

Such primers, probes and other fragments will 
preferably be at least 10, preferably at least 15 or at 
5 least 20, for example at least 25, at least 30 or at least 
40 nucleotides in length. They will typically be up to 40, 
50, 60, 70, 100, or 150 nucleotides in length. Probes and 
fragments can be longer than 150 nucleotides in length, for 
example up to 200, 300, 400, 500, 600, 700 nucleotides in 

10 length, or even up to a few nucleotides, such as five or ten 
nucleotides, short of the full length of the sequence of SEQ 
ID NO: 5, 7 or 9. 

Polynucleotides such as DNA polynucleotide and 
primers according to the invention may be produced 

15 recombinant ly, synthetically, or by any means available to 
those of skill in the art. They may also be cloned by 
standard techniques. The polynucleotides are typically 
provided in isolated and/or purified form. 

In general, primers will be produced by synthetic 

20 means, involving a stepwise manufacture of the desired 

nucleic acid sequence one nucleotide at a time. Techniques 
for accomplishing this using automated techniques are 
readily available in the art. 

Longer polynucleotides will generally be produced 

25 using recombinant means, for example using PCR (polymerase 
chain reaction) cloning techniques. 

Although in general the technicjues mentioned herein 
are well known in the art, reference may be made in 
particular to Sambrook et al, 1989, Molecular Cloning: a 

30 laboratory manual. 

A polypeptide of the invention comprises the amino 
acid sequence set out in SEQ ID NO: 6, 8 or 10 or a 
substantially homologous sequence, or a fragment of the said 
sequences and typically has methyl oxidase or cytochrome P- 

35 450 monooxygenase activity. In general, the naturally 
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occurring amino acid sequence shown in SEQ ID NO: 6, 8 or 10 
is preferred. 

A polypeptide of the invention may comprise: 

(a) the polypeptide sequence of SEQ ID NO: 2, 4, 6, 8, 
5 10 or 12; or 

(b) a homologue or fragment thereof. 

A homologue may occur naturally, for example, in a 
bacterium and will function in a substantially similar 
manner to the protein of SEQ ID NO: 6, 8 or 10, for example 

10 it acts as a methyl oxidase in the case of a homolocfue of 

SEQ ID NO: 6 or a cytochrome P-450 monooxygenase in the case 
of a homolocfue of SEQ ID NO: 8 or 10. 

Homologues can be obtained by following the 
procedures described herein for the production of the 

15 polypeptides of SEQ ID NO: 6, 8 or 10 and performing such 

procedures on a suitable cell source e.g. a bacterial cell. 
It will also be possible to use a probe as defined above to 
probe libraries made from bacterial cells in order to obtain 
clones encoding homologues. The clones can be manipulated by 

20 conventional techniques to generate a polypeptide of the 
invention which can then be produced by recombinant or 
synthetic techniques known per se. 

A homologue of a polypeptide of the invention 
preferably has at least 80% sequence identity to the protein 

25 of SEQ ID NO: 6, 8 or 10, or more preferably at least 90%, 

at least 95%, at least 97% or at least 99% sequence identity 
thereto over a region of at least at least 40, preferably at 
least 60, for instance at least 100 contiguous amino acids 
or over the full length of SEQ ID NO: 6, 8 or 10. 

30 The sequence of the polypeptide of SEQ ID NO: 6, 8 

or 10 and of homologues can thus be modified to provide 
polypeptides of the invention. Amino acid substitutions may 
be made, for example from 1, 2 or 3 to 10 or 20. 
substitutions. The modified polypeptide generally retains 

35 activity as a methyl oxidase or cytochrome P-450 

monooxygenase. Conservative substitutions may be made, for 
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example according to the following Table. Amino acids in the 
same block in the second column and preferably in the same 
line in the third column may be substituted for each other. 



ALIPHATIC 


won-poiar 


GAP 


1 L V 


Fo±ar-uncnargea 


C" S 'i' M 


W Q 


Polar- cnargea 


D E 


K K 


AROMATIC 


H f w y 



Polypeptides of the invention also include fragments 
of the above-mentioned full length polypeptides. Such 
fragments typically retain activity as a methyl oxidase or 
cytochrome P-450 monooxygenase . 

10 Polynucleotides of the invention can be incorporated 

into a recombinant replicable vector. The vector may be used 
to replicate the nucleic acid in a compatible host cell. 
Thus, in a further embodiment, the invention provides a 
method of making polypeptides of the invention by 

15 introducing a polynucleotide of the invention into a 

replicable vector, introducing the vector into a compatible 
host cell, and growing the host cell under conditions which 
bring about replication of the vector. The vector may be 
recovered from the host cell . 

20 Preferably, a polynucleotide of the invention in a 

vector is operably linked to a control sequence which is 
capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an expression 
vector. The term "operably linked" refers to a juxtaposition 

25 wherein the components described are in a relationship 
permitting them to function in their intended manner. A 
regulatory sequence, such as a promoter, "operably linked" 
to a coding sequence is positioned in such a way that 
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expression of the coding sequence is achieved under 
conditions compatible with the regulatory sequence. 

Vectors of the invention may be transformed into a 
suitable host cell to provide for expression of a 
5 polypeptide of the invention. Thus, the invention provides a 
process for preparing a polypeptide according to the 
invention which comprises cultivating a host cell 
transformed or transfected with an expression vector 
encoding the polypeptide and recovering the polypeptide. 

10 Each of the genes ORFl, ORF2 and ORF3 can be used 

for various purposes separately or in combination. This will 
be discussed in detail below. 

Targeted inactivation of one or more of the present 
genes, e.g. through marker insertion or replacement with a 

15 non- functional gene equivalent, interferes with at least one 
(oxidation) step in the Pimaricin biosynthetic route. This 
results in the production of modified Pimaricin molecules 
characterized by a different oxidative state. For example, 
molecules can be created lacking the epoxide function at 

20 carbons C4 and C5 , or molecules with a modified oxidation 

state of the carboxyl group at C12 resulting in an aldehyde, 
alcohol, or methyl group at this position. 

Disruption of chromosomally encoded genes can be 
accomplished by gene replacement strategies. Gene 

25 replacement is preferably carried out using suicide plasmid 
vectors or defective phage vectors carrying modified target 
genes and detection or selection marker genes. The various 
elements useful for such strategies, and how to employ them, 
are described below. 

30 Target gene modification can be accomplished by 

disruption of a coding sequence by insertion or deletion of 
nucleotides or nucleotide stretches. Such insertions or 
deletions may be of any suitable size. Preferably, they are 
of a size of at least 2 nucleotides, for example up to 5, up 

35 to 10, up to 25 or up to 50 nucleotides in length, excepting 
deletions which are multiples of 3. Alternatively, the 
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coding region of the target gene may be replaced by that of 
a (marker) gene. This confers an easily detectable phenotype 
on cells transformed with such a construct . Suitable 
examples of replacement genes are lacZ, xylE, Green 
5 Fluorescent Protein, and genes for the biosynthesis of 

antibiotics, such as erythromycin, apramycin, hygromycin, 
and thiostrepton, and metabolite analogues, such as 
f luoroacetamide . 

Transfer of a disrupted target gene to a Pimaricin 

10 production host, resulting in in vivo gene inactivation, can 
be accomplished by using e.g. suicide vector systems, a 
defective phage containing a fragment internal to the coding 
region of the target gene, or a variant of the gene 
inactivated through deletion or insertion of DNA stretches 

15 as described above, and optionally a detection or selection 
marker. Suicide vectors and defective phages are 
characterized by their inability to propagate autonomously 
in the strain to be transformed and thus cannot be stably 
maintained by themselves. For Streptoinycetes in general 

2 0 several suicide systems are available and suicide vectors 

can be chosen from the group of extrachromosomal element 
based cloning vectors available for E. coli, which cannot 
replicate in Streptomyces species, including for example 
pBR322, pUC, CoID, RSFlOlO, RK2 and vectors derived from 
25 these plasmids . Similarly, Streptomyces plasmids 

characterized by a limited host range can be selected that 
are incapable of stable maintenance in the desired host 
strain. Examples of such narrow host range plasmids are 
SLP1.2 and SCP2 , and cloning vectors derived from these 

3 0 plamids. Still another possibility is to use temperature 

sensitive variants of Streptomyces wide host range plasmids. 
These plamids are characterized by their inability to 
replicate above a certain (restrictive) temperature. Besides 
non-replicative plasmids, defective phage vectors have been 
35 developed based on the Streptomyces phage phiC31 and have 
proven extremely useful for genetic analysis. In this 
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regard, it is noted that an extensive overview of known 
Streptomyces genetic engineering techniques may be found in 
Hopwood et al . (D.A. Hopwood, M.J. Bibb, K.F. Chater, T. 
Kieser, C.J. Bruton, H.M. Kieser, D.J. Lydiate, CP. Smith, 
5 J.M. Ward, H. Schrempf, Genetic Manipulation of 
Streptomyces: A Laboratory Manual, The John Innes 
Foundation, Norwich, England, 1985) . 



introduced in a desired host cell using transformation 

10 procedures with isolated DNA, by conjugation from a donor 
microorganism, e.g. an E. colx or Streptomyces strain 
harboring the construct, or via transfection by phage 
particles. All of these methods are well within the 
knowledge of the person skilled in the art. 

15 Upon introduction of such a construct in the 

microorganism of interest, e.g. Streptomyces natalensis , 
stable maintenance of the introduced genetic information is 
only possible by integration of the construct in the host 
chromosome, preferably by homologous recombination with the 

20 chromosomal copy of the target gene. Strains having 

integrated the construct in the chromosome can be detected 
by the expression of a co- introduced marker. In case of a 
detection marker, transformed colonies can be screened for 
acquired properties such as conversion of a colorless 

25 substrate into a colored compound (applicable with e.g. the 
genes lacZ, or xylE) or fluorescence (by expression of e.g. 
Green Fluorescent Protein) . Alternatively, a marker can be 
used which allows selection of transformed strains by 
acquired resistance to e.g. antibiotics or toxic metabolite 

30 analogues. The latter method usually is employed more 

frequently because only cells with the acquired resistance 
will be able to grow in media containing the antibiotic or 
toxic metabolite analogue. If an internal fragment of the 
target gene is used for the construction of the suicide 

35 vector or defective phage, integration of the construct into 
the chromosomal copy of the target gene will result in 



The above mentioned suicide constructs can be 
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inactivation immediately. If the suicide construct or 
defective phage contains the complete target gene or a 
fragment including the N- terminal or C- terminal coding 
region, though inactivated through smaller insertions or 
5 deletions, only integration of the construct will result in 
the presence of an active and' inactive copy of the gene, 
separated by vector DNA. For obtaining a strain with only an 
inactive copy, a second homologous recombination has to take 
place removing the vector sequences and the active copy of 

10 the target gene. Strains having undergone this second 

homologous recombination can be detected by the loss of the 
acquired property encoded by the co- introduced marker gene. 

Another application of the present genes from the 
Pimaricin gene cluster lies in overexpression of one or more 

15 of these genes in the natural host, Streptomyces nata^lensls . 
The expression of the individual genes within the cluster is 
tightly regulated by the cell physiology and/or cluster 
specific regulatory genes. This internal control may be 
appropriate for production of the antibiotic in the natural 

20 environment, but is undesirable for industrial production. 
Overexpression of all genes of the cluster by introduction 
of additional gene copies or altering the controlling 
elements (e.g. promoters or regulatory genes) can boost 
antibiotic production considerably. This has been shown for 

25 e.g. Actinorhodin production by Streptomyces coelicolor. A 
similar effect can be obtained by overexpression, 
specifically of those genes encoding enzymes representing 
rate limiting steps in antibiotic biosynthesis. 

Additional copies of each of the present genes from 

3 0 the Pimaricin biosynthesis gene cluster or homologues or 
fragments thereof, either separately or in different 
combinations, can be introduced into StrBptowyces 
nat^lensis . This increases the efficiency of the oxidative 
reactions leading to biosynthesis of the natural Pimaricin 

35 molecule, and results in strains displaying improved 

Pimaricin production. This increase may be in the form of 
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increased Pimaricin titre in the culture broth or a higher 
product yield on substrate consumed. Of course, enhanced 
expression of certain genes can also be combined with 
inactivation of other genes, thus effecting improved 
5 production of variants of Pimaricin as described above. 

Strains containing additional copies of target genes 
can be obtained through introduction of complete genes 
including expression signals (promoters and optionally 
enhancers) into the host chromosome. Suitable techniques 

10 include suicide vectors and defective phage, as described 

above. Alternatively, autonomously replicating DNA molecules 
derived from phage genomes or extrachromosomal elements, for 
example plasmids, can be used to carry the additional genes. 
Suitable cloning vectors include those derived from plasmids 

15 pIJlOl and SCP2 . Other vectors can be constructed based on 
the plasmid naturally occurring in Streptoznyces natalensis, 
as disclosed in GB patent application nr 2210619, using 
selection and/or detection markers similar to those employed 
for the pIJlOl derived vectors, such as pIJ702, pIJ486, with 

20 or without added markers as described above. 

For gene expression, a large variety of promoters 
efficiently directing transcription of genes in Streptomyces 
is available- An example of a constitutive promoter is the 
ermE promoter, which directs expression of the earythromycin 

25 resistance gene from Saccharopolyspora erythraea. By 

contrast the agarase gene promoter from S . coelicolor, the 
promoter of the glycerol utilization operon, or the tipA 
promoter are examples of promoters inducible by specific 
substrates. Using techniques known in the art additional 

30 promoters can be obtained, e.g. promoters endogenous to 

S .natalensis (see J.M.Ward, G.R.Janssen, T.Kieser, M.J.Bibb, 
M.J.Buttner, M.J.Bibb. 1986. Mol . Gen . Genet . 203: 468-478). 

The degree of overexpression can be manipulated by 
the choice of the promoter, by the amount of inducing 

35 compound, or by the choice of the autonomously replicating 
vector systems. Depending on the vector derivative used. 
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predetermined plasmid copy numbers can range from 1 or 2 to 
about 500. It is well within the expertise of the normal 
person skilled in the art to adjust the vector system to the 
desired degree of overexpression . 
5 Both of the above uses of polynucleotides of the 

invention, i.e. inactivation to obtain new variants of 
Pimaricin and overexpression to increase Pimaricin 
productivity, can also be applied to strains producing 
structurally similar bioactive compounds for instance 

10 polymer antibiotics such as Amphotericin B {Streptomyces 

nodosus) , Nystatin {Streptomyces noursel) (see Figure 4) to 
obtain variants of these compounds and/or to improve 
productivity Using the present genes to inactivate the 
corresponding genes in Streptomyces species other than 

15 Streptomyces natalensis will result in new derivatives of, 

inte:r alia, nystatin and amphotericin B which are altered in 
their oxidative state. 

A further application of the polynucleotides of the 
invention is the heterologous expression and exploitation of 

20 the enzymatic activity encoded by one or more of the said 
polynucleotides. Using similar vector systems as employed 
for overexpression of the oxidative genes in S .natalensis , 
other microorganisms, preferably Strep toirrycetes species for 
instance the strain Streptomyces lividans or Stjreptoinyces 

25 coelicolor, can be genetically transformed and thus acquire 
new oxidative enzymatic activity. This route is particularly 
useful for application of the enzymatic activities of 
polypeptides of the invention to the oxidative modification 
of other, preferable bioactive, compounds. Examples include 

30 secondary metabolites, antibiotics and anticancer agents 
etc., which often are highly f unctionalized chemical 
entities. Thus, it is possible to introduce one or more of 
the polynucleotides of the invention into a host producing 
such bioactive compounds naturally, or one which has 

35 acquired the genetic information to produce compounds by 

recombinant DNA technology. A strain having acquired a gene 
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or genes encoding oxidative enzymatic activity from the 
Pimarcin biosynthetic gene cluster will then be able to 
introduce, for example, epoxide functions or alcohol, 
aldehyde, or carboxyl groups into metabolites previously not 
5 modified in such a way. In this way it is possible to 
oxidize a methyl group which is not part of an linear 
alkane . A methyl group forming part of an aliphatic ring of 
an organic compound or biocompound can be oxidized by one or 
more of the polypeptides of the invention. The polypeptides 

10 of the invention can be isolated or purified from rDNA 
transformed hosts in which one or more of the 
polynucleotides of the invention are introduced. Preferably 
the polynucleotide are heterologous to the host. But also 
the transformed host as such may be used for the oxidative 

15 conversion. Thus, an approach has been provided, which 
allows for the creation of new variants of bioactive 
compounds not obtainable by chemical means (exemplified in 
Example 6 below) , 

The invention will now be demonstrated by the 

20 following, non-restrictive examples. 

Ex^unpleg 

Example 1. Isolation and identification of Pimaricin 
25 biosynthetic genes, 

Streptomyces nat^lensis strain ATCC27448 was grown 
in YEME medium (D.A. Hopwood, M.J. Bibb, K.F. Chater, T. 
Kieser, C.J. Bruton, H.M. Kieser, D.J. Lydiate, CP. Smith, 

30 J.M. Ward, H. Schrempf, Genetic Manipulation of 

Streptomyces: A Laboratoiry Manual, The John Innes 
Foundation, Norwich, England, 1985) at 30^C for 3 days. 
Mycelium was harvested and total DNA was extracted and 
purified essentially as described by Hopwood (ibid.). 

35 Total S.natalensis DNA was subjected to partial 

digestion with the restriction enzyme Sau3AI and size 
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fractionated on 0.8% agarose gel. Fragments of 3 0-40 kbp 
were isolated, inserted into BanMl digested cosmid Supercosl 
and subsequently introduced in E.coli strain XLl-Blue MR 
according to protocols suggested by the supplier 
5 (Stratagene, La Jolla) . 

Thus, a cosmid library of S.natalensis DNA in E.colx 
was obtained. The cosmid library was screened for the 
presence of polyketide synthase (PKS) related sequences by 
hybridization with radioactively labeled fragments from 

10 known PKS genes from the Rapamycin biosynthesis cluster from 
Streptomyces hygroscoplcus (T.Schwecke, J. F. Aparicio, 
Y.Molnar, A.Konig, L.E.Khaw, S.F.Haydock, M.Oliynyk, 
P.Caffrey, J.Cortes, J.B.Lester, G.A.Bohm, J.Staunton, 
P.F.Leadlay. 1995. Proc . Natl. Acad. Sci . USA 92: 783 9- 

15 7843) . 

Several clones were isolated which contained 
sequences hybridizing to a fragment containing the KS module 
5 of rapB . 

Complete DNA sequence determination of a number of 
20 neighbouring Notl fragments from Cos9 was performed after 
cloning the fragments in pBluescript. Computer assisted 
analysis of the DNA sequences revealed the presence of genes 
clearly identifiable as PKS gene modules on the basis of 
nucleotide and derived amino acid sequence homology with 
25 established PKS genes and proteins involved in the 

biosynthesis of erythromycin and rapamycin, as well as with 
fatty acid synthase genes and proteins, which catalyze a 
similar set of reactions. The complete nucleotide sequences 
and derived amino acid sequences of two Pimaricin PKS genes 
30 are given as SEQ ID numbers 1-4. 

Using a 0.7 kb Notl fragment from Cos9 as a probe, 
the extent of the PKS related genes on the cosmid map was 
established as indicated in Figure 1. 
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Example 2 , PKS genes are essential for Pimaricin 
^j.opynthgsig 

A completely sequenced 3.3 kb Notl DNA fragment (see 
5 Figure 1) (in pBluescript) , encoding part of a S . natalensis 
PKS as deduced form the organizational and structural 
sequence similarities with knovm PKS, was excised by Sad 
from the sequencing vector. The fragment was subcloned into 
the phage vector KC515 (M.R.Rodicio, C.J.Bruton, K.F.Chater. 

10 1985. Gene 34: 283-292) and introduced in S.llvldans to 

obtain infectious particles (recombinant phage) containing 
the S .natalensis PKS fragment. Infection of S .natalensis 
using this recombinant phage population and selection for 
resistance to the antibiotic viomycin, allowed the isolation 

15 of lysogens, originated through integration of the 

recombinant phage DNA into the S .natalensis chromosomal DNA 
by homologous recombination of the PKS regions. 

None of 2 0 lysogens tested displayed antifungal 
activity as analyzed by an agar plate bioassay using Candida 

20 utiiis as the indicator organism. Detailed analysis of one 
of the lysogens by Southern hybridization studies confirmed 
that integration of the recombinant phage DNA into the 
S .natalensis chromosomal PKS locus had indeed occurred. 

Culturing the lysogen with the disrupted PKS gene in 

25 standard production medium (25 g/1 soya peptone, 0.5 mM 

ZnSO^, 20 g/1 glucose, pH 7.5) followed by extraction of the 
culture broth with butanol, and UV spectrophotometric 
analysis indicated that no traces of Pimaricin were produced 
by this lysogen (J.F.Martin, A.L.Demain. 1975. Biochem. 

30 Biophys. Res. Commun. 71: 1103-1109). 

Example 3 . Detailed sequpnce analysis of non^PKS 
genes: preliminary identification. 



35 



Full sequence analysis of the regions flanking the 
PKS genes of Example 1 revealed the presence of additional 
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open reading frames (ORF) potentially encoding proteins 
functional in Pimaricin biosynthesis. 

Homology comparison of the deduced amino acids 
sequences of the ORFs indicated the involvement of several 
5 in oxidation/reduction reactions. ORFl showed a clear 

homology with previously identified cholesterol oxidases and 
ORF2 and ORFS were similar to cytochrome P-450 monooxygenase 
proteins. Also, genes encoding accessory proteins for the P- 
450 enzymes seem to be present i.e. ferredoxin type. 
10. Complete nucleotide sequences of the respective genes and 

derived amino acid sequences are added as SEQ ID numbers 5- 
10. Detailed info3rmation on the chromosomal regions 
enompassing the three open reading frames (ORF's) is 
presented in Figure 2 . 

15 

igyaTTipi^ 4. Functional characterization of non-PKS genes 
involved in Pimaricin biosvnthesis . 

To define the involvement of the accessory 

20 genes/proteins in Pimaricin biosynthesis, both ORFl and ORF3 
were disrupted and the effect on Pimaricin production 
established. Similar strategies as described in Example 2 
for the PKS disruption were employed for the non-PKS genes. 
ORFl : a 7kb Sphl fragment containing the complete ORFl was 

25 cloned into pUC19, the resulting plasmid was digested with 
Bglll, the cohesive ends were filled in by treatment with 
Klenow polymerase and religated. This new plasmid was used 
as a source for DNA for the gene replacement. The 2.9 kb 
BamHI-Pstl fragment from the plasmid was cloned into the 

30 BaniRl'Pstl sites of KC515. The recombinant phage was 

propagated in S. llvidans, and used to infect the wildtype 
S,natalensis strain. Lysogens were obtained by selection for 
thiostrepton. The second recombination event was searched 
for by the loss of thiostrepton resistance. The insertion 

35 and subsequent loss of the phage as well as the final 

structure of the disruptred gene was confirmed by Southern 
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hybridization. 

ORF3 : disruption was accomplished by insertion of a 
667 bp Pvull-Smal fragment internal to ORF3 in HinCII cut 
5 pUC19; The fragment was excised using BamHI and PstI and 
ligated into similarly digested phage vector KC515. 
Transformation of the ligation mixture to S. lividans yielded 
recombinant phage 06D4 - Iparticles . After transfection of 
S .xiatalensis , lysogens were isolated as described above. 
10 Disruption of ORF3 in S .natal ens is mutant D4 was confirmed 
by Southern hybridization 

Exaw>1e Analysis of ORFl and ORF3 aene di simp t an ts of 

15 

Strains with disrupted ORFl and 0RF3 were analyzed 
for pimaricin production using the bioassay with C.utilis . 
For both disruptants the production of an antifungal 
activity was strongly reduced as compared with the wild-type 
20 strain S . natalensis ATCC27448. 

Both strains were cultured in pimaricin production medium 
(see Example 2) and the culture filtrate was analyzed by 
combined liquid chromotography/mass spectroscopy (LC-MS) 
analysis . 

25 Disruptants in ORFl did not contain any pimaricin- 

like molecule in the culture filtrate. 

In the case of the ORF3 disruptant a single Pimaricin- like 
molecule was detected in the culture filtrate having 
molecular mass of 649.75 indicating the loss of exactly 1 

3 0 oxygen atom. The exact structure was determined by NMR 

spectroscopy to be identical to Pimaricin except that the 
epoxide function at was replaced by a double bond; the 
structure with a double bond between C^-C^ (displayed in 
Figure 3b (top) ) is the expected biosynthetic precursor for 

35 the epoxidation. 
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Example 6. Ovez-expression of ORFl , ORF2 , and ORF^ iT. 
S.natalensls . 

Separate overexpression of ORFl , ORF2 and 0RF3 was 
achieved by placing each gene under the direction of the 
erTTiE promoter from Saccharopolyspora erythraea (M.J. Bibb, 
G.R. Janssen, J.M. Ward. 1985. Gene 38: 215-226). A useful 
derivative of this promoter, having a number of cloning 
sites attached was obtained by PGR using the following 
oligonucleotides: SEQ ID 11: 
AAACTGCAGCTCTAGAGGCGGCTTGCGCCCGATGCTAGTC 

SEQ ID 12: 

AAACTGCAGCTCTAGATGCCCGGGTATCGATCGTCGACGGCATGCGGATCCTACCAACCG 
GCACGATTG 

The 225 bp PGR fragment obtained was digested with 
PstI, purified by agarose gel electrophoresis and inserted 
into PstI digested pUC19, yielding pUCermE 

ORFl was inserted in pUGermE as a 2 . 2 kb Sphl-Clal 
fragment encompassing the complete coding sequence. For ORF2 
a 3.5 kb Clal-Nrul fragment was used, and for ORF3 a 2.8 kb 
Sail- Kpnl fragment was used. Each ermE promoter-ORF 
combination was subsequently excised as a PstI fragment, 
inserted in PstI digested phage vector KG515 and introduced 
in S.natalensis essentially as described in Example 4. 

Recombinant S , natal gzis is strains were thus obtained 
which overexpressed one of the three genes . Each strain 
showed improved levels of Pimaricin production of 10 -15 % 
after growth under standard production conditions (see 
Example 2) . 

Example 7. Expression of S.natalensis QR Fl . ORF2 , and QRF3 
in S.caelicolGJ- ^rtA lividans 



ORFl and ORF2 : A 223 bp Ndel-EcoRl fragment. 
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corresponding to the 5 'end of ORFl from the ATG to the first 
EcoRl site was obtained using the Polymerase Chain Reaction 
such that an Ndel site was created coinciding with the ATG 
initiation codon of ORFl. The oligonuleot ides used for this 
5 PGR we re 5 ' - AGGATTACCCATATGTTCGAGAACCAGCAT - 3 ' ( f orwar d ; SEQ 
ID NO 13) and 5 ' -GCATGAGCGTGGGAATTCCG -3" (reverse; SEQ ID 
NO 14) . The PGR product was digested with Ndel and EcoRl 
cloned into similarly digested vector pT7-7 (S. Tabor, C.C. 
Richardson. 1985. PNAS 82, 1074) to yield plasmid pJA56. 

10 pJA56 was digested with EcoRl and Smal , and ligated 

to an EcoRl'Nrul fragment encompassing ORFl and 0RF2, 
yielding plasmid pJA57 . 

pJA57 was digested with Ndel and ligated to Ndel- 
digested pIJ6021 (E .Takano et al. 1995. Gene 166, 133). The 

15 resulting plasmid was named pJA58. Both ORFl and ORF2 are 
now under the direction of the thiostrepton inducible tipA 
promoter. Plasmid pJA5 8 was transformed into strain 
S. coelicoloz- A (3) 2 and S.lividans 1326. 

ORF3 : The ORF3 expression vector has been construe- 

20 ted by cloning a 3.7 kb Kpnl fragment containing the 

complete ORF3 into the unique Kpnl site of pHZ13 51 (Bao et 
al.. 1997. ISBA Meeting abstract 4P15) . The resulting 
plasmid (pJA50) was transformed to strain S . coelicolor A(3)2 
and S . llvidans 1326. Expression of ORF3 is directed by its 

25 own promoter. 

Example 8. Activity of cell^free extracts of S^coelicolor 
expressing ORFl. ORF2 . and ORF3 , 

30 S . coellcolor strains expressing the genes ORFl and 

2, and ORF3 , respectively, thus producing the active 
proteins pORFl, pORF2 , and pORF3 were grown in YEME medium 
(Hopwood et al . , ibid) . For induction thiostrepton was added 
to 0.005mg/l. Incubation was for 4 8 hrs . at 3 0''C. 

35 Cell-free extracts were prepared as follows: 

Mycelium was harvested by centrif ugation at 5000xg/4**C for 
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10 minutes and washed with 1 volume of 50mM Tris-HCl pH 7.5, 
ImM DTT, 10% glycerol. The mycelium was resuspended in 0.2 
volume of 50mM Tris-HCl pH 7.5, ImM DTT, 10% glycerol; 1 
tablet of protease inhibitor cocktail (Boehringer Mannheim) 
5 was added per 25 ml of extract. Cell extracts were prepared 
by sonication. After sonicatibn cell debris were removed by 
centrif ugation at lOOOOxg / 4^*0 for 10 minutes. 

Activity assays for the cell -free extracts were 

10 performed using S.coelicolor cell-free extract (lOO-lOOO/xg 
total protein); 0.5 ^imol NADPH; 5 /xmol glucose -6 -phosphate ; 
0.5 U glucose -6 -phosphate dehydrogenase; 22ug spinach 
ferredoxin; 0.05 U spinach ferredoxin NADP+ reductase. As 
substrate for the oxidation activities triketide lactone 

15 (TKL, see Figure 5; M.J.B. Brown et al . 1995. J.Chem.Soc. 

Chem.Comm. 1517; CM. Kao et al.. 1995. J. Am. Chem. Soc . 117, 
9105) was added. After allowing to react for 60-90 minutes, 
the products were extracted twice with an equal volume of 
ethylacetate, and analysed by thin layer chromatography, LC- 

2 0 MS, and NMR spectroscopy. 

It appeared that pORF3 was inactive on this specific 
substrate, but that the combined action of pORFl and pORF2 
resulted in a TKL derivative having the methyl group comple- 
tely oxidized to the carboxylic acid function (see Figure 

25 5) . 
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Claims 

1. A polynucleotide comprising: 

5 (i) a nucleic acid sequence set out in SEQ ID NO: 5^ 7 or 
9 or a sequence complementairy' thereto; or 

(ii) a homologue or fragment of a sequence defined in (i) . 

2. A polynucleotide according to claim 1 consisting 

10 essentially of the nucleic acid sequence set out in SEQ ID 
NO: 5, 7 or 9 or a sequence complementary thereto. 

3 . A polypeptide encoded by a polynucleotide according to 
claim 1 or 2 . 

15 

4. A polypeptide obtainable by expressing a 
polynucleotide according to claim 1 or 2 in a cell which is 
a StreptomycBs cell or a cell of a heterologous species. 

20 5. A polypeptide comprising the amino acid sequence set 
out in SEQ ID NO: S, 8 or 9 or a homologue or fragment 
thereof . 

6 . A recombinant cell comprising at least one additional 
25 copy of a polynucleotide according to claim 1 or 2, wherein 

the cell naturally possesses at least one said 
polynucleotide . 

7. A recombinant cell according to claim S, wherein the 
30 cell is one which naturally produces pimaricin or a related 

molecule . 

8 . A recombinant cell according to claim 7 which is a 
Streptoinyces ncLtalensls cell. 

35 
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9. A recombinant cell, wherein a polynucleotide according 
to claim 1 or 2 which naturally occurs in the cell has been 
inactivated . 

5 10. A recombinant cell according to claim 9, wherein the 
cell is one which naturally produces pimaricin or a related 
molecule . 

11- A recombinant cell according to claim 10 which is a 
10 5treptoinyces natalensis cell. 

12. A recombinant cell comprising a polynucleotide 
according to claim 1 or 2 which polynucleotide does not 
naturally occur in that cell or where the polynucleotide is 

15 heterologous to that cell . 

13. A recombinant cell according to claim 12, wherein the 
cell is one which does not naturally produce pimaricin. 

20 14. A recombinant cell according to claim 13 which is a 
Streptomyces lividans or Streptomyces coelicolor cell. 

15. A method for overexpressing a polynucleotide encoding 
a polypeptide according to any one of claims 3 to 5 in a 

25 Streptomyces cell which method comprises: 

(i) attaching a promoter sequence to the said 
polynucleotide ; 

(ii) transferring the resulting promoter-polynucleotide 
complex into the said cell; and 

30 (iii) maintaining the resulting cell under conditions 
suitable for expression of the said polynucleotide. 

16. A method for inactivating a polynucleotide encoding a 
polypeptide according to any one of claims 3 to 5 in a 

35 Streptomyces cell which method comprises disrupting the 
coding sequence of the said polynucleotide. 
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17. A method for expressing a polynucleotide encoding a 
polypeptide according to any one of claims 3 to 5 in a 
heterologous cell which method comprises: 

5 (i) attaching a promoter sequence to the said 
polynucleotide ; 

(ii) transferring the resulting promoter-polynucleotide 
complex into the said cell; and 

(iii) maintaining the resulting cell under conditions 
10 suitable for expression of the said polynucleotide. 

18 . A method for producing pimaricin which method 
comprises maintaining a cell according to any one of claims 
6 to 8 under conditions suitable for obtaining expression of 

15 the additional copy of a polynucleotide according to claim 1 
or 2 and isolating the said pimaricin. 

19. A method for producing a biomolecule which method 
comprises maintaining a cell according to any one of claims 

20 9 to 11 under conditions which would be suitable for 

obtaining expression of the inactivated polynucleotide had 
it not been inactivated and isolating the said biomolecule. 

20. A method for producing a biomolecule which method 

25 comprises maintaining a cell according to any one of claims 
12 to 14 under conditions suitable for obtaining expression 
of the polynucleotide which does not naturally occur in the 
cell and isolating the said biomolecule. 

30 21. A biomolecule obtainable by a method according to 
claim 19 or 20. 



35 



22 . Use of a recombinant cell according to any one of 
claims 6 to 8 in the production of pimaricin. 
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23 . Use of a recombinant cell according to any one of 
claims 9 to 14 in the production of a biomolecule. 

24. A vector containing a polynucleotide according to 
claim 1 or 2 which is capable of expressing a polypeptide 
according to any one of claims 3 to 5 . 

25, A cell harbouring a vector according to claim 24. 



10 26. A method for producing a polypeptide according to any 
one of claims 3 to 5, which method comprises maintaining a 
cell according to claim 25 under conditions suitable for 
obtaining expression of the polypeptide and isolating the 
said polypeptide . 

15 

27. Use of an isolated or purified polypeptide according 
to any one of claims 3 to 5 for the oxidative modification 
of a methylgroup of a suitable compound. 



20 
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Figure 2 
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Figure 5 
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SEQUENCE LISTING 

<110> DSM N,V. 

<120> Genes encoding enzymes in the biosynthesis of pimaricin and applications 
thereof 

<130> S-2959 

<140> 
<141> 

<160> 14 

<170> Patentin Ver . 2.1 

<210> 1 
<211> 5544 
<212> DNA 

<213> Streptomyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (5544) 

<400> 1 

atg gtg ccc gtc cac aca gat gac tac gcg ate cag ccg ccc gcc gac 48 

Met Val Pro Val His Thr Asp Asp Tyr Ala lie Gin Pro Pro Ala Asp 
15 10 15 

acc gcg cat gga ggg ggt ggc ttc aca ctg cct get gtg tte gag gee 96 
Thr Ala His Gly Gly Gly Gly Phe Thr Leu Pro Ala Val Phe Glu Ala 
20 25 30 

gcc gtg gaa tec gcc ccc gac gca gtg gee etc gtc gac ggc acc gtt 144 
Ala val Glu Ser Ala Pro Asp Ala Val Ala Leu Val Asp Gly Thr Val 
35 40 45 

cct gga cct ggg cgc atg tgg cgc gca gac gtc gat gcg ctg gcc cgc 192 
Pro Gly Pro Gly Arg Met Trp Arg Ala Asp Val Asp Ala Leu Ala Arg 
50 55 60 

ggg ttg cag gag teg ggt ate gca ccc ggt gac gtg gtg gcg gtg egg 240 
Gly Leu Gin Glu Ser Gly lie Ala Pro Gly Asp Val Val Ala Val Arg 
65 70 75 80 

ctg ccg aac tgc ggg agg ttt ccc ace ctg cac ctg gcc gtc gcg gee 28 8 
Leu Pro Asn Cys Gly Arg Phe Pro Thr Leu His Leu Ala Val Ala Ala 
85 90 95 

gtc gga gcg gta ctg ctg ccc ate cac cag ggc acc ccg etc ccg gag 336 
Val Gly Ala Val Leu Leu Pro lie His Gin Gly Thr Pro Leu Pro Glu 
100 105 110 

gtc gac gcg ctg ctg ace egg gcg gaa ccc gee cct etc gtc ctg teg 384 
Val Asp Ala Leu Leu Thr Arg Ala Glu Pro Ala Leu Leu Val Leu Ser 
115 120 125 

gee gee ggg age gac ggc ctg gcg aeg gee egt teg etc ctg gag age 432 
Ala Ala Gly Ser Asp Gly Leu Ala Thr TQa Arg Ser Leu Leu Glu Ser 
130 135 140 
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gtg ccg teg ctg cgc ggc gtg ctg ctg gcc ggg gcg teg gga gae ggg 48 0 
Val Pro Ser Leu Arg Gly Val Leu Leu Ala Gly Ala Ser Gly Asp Gly 
145 150 155 160 

gag tee gga tec gtc gga ggt ggg gag tea gga tec ggg cgt egg teg 52 8 
Glu Ser Gly Ser Val Gly Gly Gly Glu Ser Gly Ser Gly TVrg Arg Ser 
165 170 175 

ctg gac ggg etg ctg gcc ggc tgg gcg ggc age ggg ccc egg ccg gtg 576 
Leu Asp Gly Leu Leu Ala Gly Trp Ala Gly Ser Gly Pro Arg Pro Val 
180 185 190 

gac gtc acc ccc gat atg ccg etc gtt etc gtc ccc teg tec ggg acg 624 
Asp Val Thr Pro Asp Met Pro Leu Val Leu Val Pro Ser Ser Gly Thr 
195 200 205 

gtc teg gca egg eec aaa etg tgt gtg cae age cae gac ggt ctg ctg 672 
Val Ser Ala Arg Pro Lys Leu Cys Val His Ser His Asp Gly Leu Leu 

210 215 220 

teg aac ace gca gee gtc acc gee gag gee gee gae gcc tte gac ggt 720 
Ser Asn Thr Ala Ala Val Thr Ala Glu Ala Ala Asp Ala Phe Asp Gly 
225 230 235 240 

ccg gtc etc acg gcc tgc ccg atg acc cat ctg tte ggc etc cag tec 768 
Pro Val Leu Thr Ala Cys Pro Met Thr His Leu Phe Gly Leu Gin Ser 
245 250 255 

etg cae gcg gcg etg tte gcc gcc tgc acg cag gtc ctg etc acc ggc 816 
Leu His Ala Ala Leu Phe Ala Ala Cys Thr Gin Val Leu Leu Thr Gly 
260 265 270 

tgg gac gtg gae cgc tte etg gaa cag get egg gag cae ggc ccc cgt 864 
Trp Asp Val Asp Arg Phe Leu Glu Gin Ala Arg Glu His Gly Pro Arg 
275 280 285 

gtc gtg tte gcc gtg ccc gcc caa ctg egg gac gtc gtg aca cga etg 912 
Val Val Phe Ala Val Pro Ala Gin Leu Arg Asp Val Val Thr Arg Leu 
290 295 300 

gcc agg acc ggc gaa ccg gcg ggc tte acg ccg tac cag gtg cgt acc 96 0 
Ala Arg Thr Gly Glu Pro Ala Gly Phe Thr Pro Tyr Gin Val Arg Thr 
305 310 315 320 

gcg ggg gee gee gtc gcg ccc gcg etc gee gta egg gta cgc gee gtc 1008 
Ala Gly Ala Ala Val Ala Pro Ala Leu Ala Val Arg Val Arg Ala Val 
325 330 335 

etc gac tgc gaa ctg gtc gtg gtg tgg ggg atg tec gag ate ggc ace 1056 
Leu Asp Cys Glu Leu Val Val Val Trp Gly Met Ser Glu lie Gly Thr 
340 345 350 

ggc acc cgc acc egg gcc cae cae ccc gac ggc tgt gtg gga gag ccg 1104 
Gly Thr Arg Thr Arg Ala His His Pro Asp Gly Cys Val Gly Glu Pro 
355 360 365 

gtc age ggg gtg gac gtg cgc gtc gtc gac gag cac ggc caa gag tgc 1152 
Val Ser Gly Val Asp Val Arg Val Val Asp Glu His Gly Gin Glu Cys 
370 375 380 
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gcg gcg gac gag agg ggt gaa etc cag tac egg ggg cca ggg ttg ttc 12 00 
Ala Ala Asp Glu Arg Gly Glu Leu Gin Tyr Arg Gly Pro Gly Leu Phe 
385 390 395 400 

cgc ggc tat ttc cgc gaa ccg gag ctg acg cgc teg gee etc ace gac 124 8 
Arg Gly Tyr Phe Arg Glu Pro Glu Leu Thr Arg Ser Ala Leu Thr Asp 
405 410 415 

gac ggc tgg ctg egg ace ggc gat etc gcg ace gtc gac gcg gac ggc 1296 
Asp Gly Trp Leu Arg Thr Gly Asp Leu Ala Thr Val Asp Ala Asp Gly 
420 425 430 

gtg gtg gtg ctg cae ggc egg gcg gee gag ctg ate aat acg ggg ggc 1344 
Val Val Val Leu His Gly Arg Ala Ala Glu Leu He Asn Thr Gly Gly 
435 440 445 

egg aag ttc age gee ggc gag gtc gag gga ctg etc tec ggc ttt acg 1392 
Arg Lys Phe Ser Ala Gly Glu Val Glu Gly Leu Leu Ser Gly Phe Thr 
450 455 460 

gac ctg ggc ccg ctg gee gtc gtc ggc gea ccg gac gac egg etc ggg 1440 
Asp Leu Gly Pro Leu Ala Val Val Gly Ala Pro Asp Asp Arg Leu Gly 
465 470 475 480 

gag tac ccg tgc ctg gtc gtg ace gac cae gcg gac ggc ace ate ggc 14 88 
Glu Tyr Pro Cys Leu Val Val Thr Asp His Ala Asp Gly Thr He Gly 
485 490 495 

ctg age gag gtg acc gcg ttc ctg cgc egg ctg ggg etc gee gac cae 1536 
Leu Ser Glu Val Thr Ala Phe Leu Arg Arg Leu Gly Leu Ala Asp His 
500 505 510 

aag ate eee etc gaa ctg gtc ace gtg cgc gag ctt cce ttc tec cct 1584 
Lys He Pro Leu Glu Leu Val Thr Val Arg Glu Leu Pro Phe Ser Pro 
515 520 525 

gee ggg aag etc gac cgc ggg gcg etc aag egg ctg etc gee aac ctt 1632 
Ala Gly Lys Leu Asp Arg Gly Ala Leu Lys Arg Leu Leu Ala Asn Leu 
530 535 540 

gcg gag gtg tec gtt ccg gee cgt etc ggc gcg gtc ccg cce tac acg 1680 
Ala Glu Val Ser Val Pro Ala Arg Leu Gly Ala Val Pro Pro Tyr Thr 
545 550 555 560 

gcc gag gag gcg ctg gac etc gta cgc gac tgc gtc ggc egg gtg etc 1728 
Ala Glu Glu Ala Leu Asp Leu Val Arg Asp Cys Val Gly Arg Val Leu 
565 570 575 

cgt tac ggc ggc gcg gcc gtc cce ttc cce ccg gac aag gac ttc ttc 1776 
Arg Tyr Gly Gly Ala Ala Val Pro Phe Pro Pro Asp Lys Asp Phe Phe 
580 585 590 

tee ccg gac aag gac ttc cgc cag etc ggc ctg gac tec ate ggc gcg 1824 
Ser Pro Asp Lys Asp Phe Arg Gin Leu Gly Leu Asp Ser lie Gly Ala 
595 600 605 

gtg egg ctg egg aat ctg ctg egg gag gag ace ggg ctg ccg etc ccg 1872 
Val Arg Leu Arg Asn Leu Leu Arg Glu Glu Thr Gly Leu Pro Leu Pro 
610 615 620 

gcc ace ctg gee ttc gac tee cce act ccg cga gee gtc gcg cgc gtc 1920 
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Ala Thr Leu Ala Phe Asp Ser Pro Thr Pro Arg Ala Val Ala Arg Val 
625 630 635 640 

ctg gcc gag cag gag gag ccg tec cag gac gag ccg agg gag aac ccg 1968 
Leu Ala Glu Gin Glu Glu Pro Ser Gin Asp Glu Pro Arg Glu Asn Pro 
645 650 655 

gcg gac ggt gcc gac ccg gtg gcg ate gtg ggc atg gcc tgc egg ctg 2016 
Ala Asp Gly Ala Asp Pro Val Ala He Val Gly Met Ala Cys Arg Leu 
660 665 670 

ccg ggc gga gcc gac tec ecc gac gcc ctg tgg gag ctg etc gcc gac 2064 
Pro Gly Gly Ala Asp Ser Pro Asp Ala Leu Trp Glu Leu Leu Ala Asp 
675 680 685 

ggg acc gac gcg atg tec ccc ttc ecc acg gac egc ggc tgg gac ctg 2112 
Gly Thr Asp Ala Met Ser Pro Phe Pro Thr Asp Arg Gly Trp Asp Leu 
690 695 700 

gac egg ctg ttc gac gag gat gcc gac egc ccg ggt acc teg tac gee 2160 
Asp Arg Leu Phe Asp Glu Asp Ala Asp Arg Pro Gly Thr Ser Tyr Ala 
705 710 715 720 

egc gaa ggc ggc ttc ctg cac gac gcg ggc gac ttc gac gcg ggc ttc 2208 
Arg Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Gly Phe 
725 730 735 

ttc ggc ctg teg gac cag gag gcg acg gcg acc gat ccg cag cag egg 2256 
Phe Gly Leu Ser Asp Gin Glu Ala Thr Ala Thr Asp Pro Gin Gin Arg 
740 745 750 

Ctg ctt ctg gag gcg gee tgg gag acc ttc gag egg gcg ggc ate gac 2304 
Leu Leu Leu Glu Ala Ala Trp Glu Thr Phe Glu Arg Ala Gly He T^p 
755 760 765 

ccg cag tec ctg agg gga age egt acg ggc gtg ttc acg ggc gcg atg 2352 
Pro Gin Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Thr Gly Ala Met 
770 775 780 

gac egc ggc tat gga ace age gcg tec gcc gcg ccc age gca tgg gag 2400 
Asp Arg Gly Tyr Gly Thr Ser Ala Ser Ala Ala Pro Ser Ala Trp Glu 
785 790 795 800 

age atg etc ate acc ggg acc gee ggc age gcg gtc teg ggg egc ate 244 8 
Ser Met Leu He Thr Gly Thr Ala Gly Ser Ala Val Ser Gly Arg He 
805 810 815 

gee tac acc tac ggg etc gaa ggc ccc gcg ctg acg gtc gac acc gee 2496 
Ala Tyr Thr Tyr Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala 
820 825 830 

tec teg tec tec etc gtc gee ctg eat ctg gcc tgc egg teg ctg egc 2544 
Ser Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Arg Ser Leu Arg 
835 840 845 

teg ggc gag ace gac ctg gcg ctg gee ggc ggc gtc ace gtc atg gcg 2592 
Ser Gly Glu Thr Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala 
850 855 860 

acc ccg gcg ccc ttc gca cac ttc tec egg ctg egc gcg ctg tec ccc 2640 
Thr Pro Ala Pro Phe Ala His Phe Ser Arg Leu Arg Ala Leu Ser Pro 
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865 870 875 880 

gac tec cgc tec atg gcg tac gcg gac gcc gcg aac ggc teg gcg tgg 2688 
Asp Ser Arg Ser Met Ala Tyr Ala Asp Ala Ala Asn Gly Ser Ala Trp 
885 890 895 

teg gag gge gcg ggg etg ctt ctg etg gag egg ctg age gac gcc egg 2736 
Ser Glu Gly Ala Gly Leu Leu Leu Leu Glu T^g Leu Ser Asp Ala Arg 
900 905 910 

cgc aac gga cac cgt gtc ctg gcg etc gta egg ggc tec gee gtg aat 2784 
Arg Asn Gly His Arg Val Leu Ala Leu Val Arg Gly Ser Ala Val Asn 
915 920 925 

cag gac ggc gcc tec aac ggg etc acc gcg ccg age gga ecc gea eag 2832 
Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin 
930 935 940 

cag ege gtc ate cgc cag gcc ctg gcc gac gcc ggg ctg aeg ccg eag 2880 
Gin Arg Val lie Arg Gin Ala Leu Ala Asp Ala Gly Leu Thr Pro Gin 
945 950 955 960 

gac gtg gac gee gtg gag ggg cae ggc ace ggc aeg ccg etc ggc gac 2928 
Asp Val Asp Ala Val Glu Gly His Gly Thr Gly Thr Pro Leu Gly Asp 
965 970 975 

ecc ate gag gcg cag gcg ctg etg gee aeg tac ggc caa cag egg cet 2976 
Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Pro 
980 985 990 

gtg gaa egg ccg ttg tgg ctg ggg teg gtg aag teg aac ttc ggg cac 3 024 
Val Glu Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly His 
995 1000 1005 

aca caa gee gee gee ggg gtc gtc gge gtc ate aag aeg gtg etc gcg 3072 
Thr Gin Ala Ala Ala Gly Val Val Gly Val lie Lys Thr Val Leu Ala 
1010 1015 1020 

ctg cgc cac ggc gtc ctg ccg eag aeg etg cac gtg gac get ecc teg 3120 
Leu Arg His Gly Val Leu Pro Gin Thr Leu His Val Asp Ala Pro Ser 
1025 1030 1035 1040 

gcc aag gtg gac tgg tec gcc ggt teg gta egg etg etg acc gag gcg 3168 
Ala Lys Val Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Ala 
1045 1050 1055 

egg ecc tgg cea egg gag age gga cgt aeg cgc egg gcg ggg gtg tec 3216 
Arg Pro Trp Pro Arg Glu Ser Gly Arg Thr Arg Arg Ala Gly Val Ser 
1060 1065 1070 

teg ttc ggg etc acc ggc acc aac gcg cae gtg ate etg gag gag gcg 3264 
Ser Phe Gly Leu Thr Gly Thr Asn Ala His Val lie Leu Glu Glu Ala 
1075 1080 1085 

ccg gga gag gcg gcg gea ggg gcg egg gcc gag gtt ecc gag gag gcg 3312 
Pro Gly Glu Ala Ala Ala Gly Ala Arg Ala Glu Val Pro Glu Glu Ala 
1090 1095 1100 

egg tge gcc tec tea ccg get cga etc ccg gag ccg ecc ggc gac gcg 3 360 
Arg Cys Ala Ser Ser Pro Ala Arg Leu Pro Glu Pro Pro Gly Asp Ala 
1105 1110 1115 1120 
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gcc gcg ccc tgg gtg ctg tec gcc egg age egg gcg geg ctg cgc gcg 34 08 
Ala Ala Pro Trp Val Leu Ser Ala Arg Ser Arg Ala Ala Leu Arg Ala 
1125 1130 1135 

cag gcg etc cgc ctg gcc gac cag gtg gcc gcc gac ccc ggt eta egg 3456 
Gin Ala Leu Arg Leu Ala Asp Gin Val Ala Ala Asp Pro Gly Leu Arg 
1140 1145 1150 

gee cag gat gtc gee cat. gee ctg gcc ace tec egt ace ctg cac egg 3504 
Ala Gin Asp Val Ala His Ala Leu Ala Thr Ser Arg Thr Leu His Arg 
1155 1160 1165 

cac egg gee gtc gtc age ggc tec gac egg gea eaa atg etc gcc gcg 3552 
His Arg Ala Val Val Ser Gly Ser Asp Arg Ala Gin Met Leu Ala Ala 
1170 1175 1180 

gea aag egg ttc ggg etc ggt gag egg acc gcg ggc gtc ace ccg gac 3600 
Ala Lys Arg Phe Gly Leu Gly Glu Arg Thr Ala Gly Val Thr Pro Asp 
1185 1190 1195 1200 

gat tec gcg ccg ggc ctg ctg gcc ttc gtc ttc tec ggg cag ggc age 364 8 
Asp Ser Ala Pro Gly Leu Leu Ala Phe Val Phe Ser Gly Gin Gly Ser 
1205 1210 1215 

cag cgc age ggc atg ggg cgc gcg gcg gcc gag gcg ttc ccg gtc ttc 3696 
Gin Arg Ser Gly Met Gly Arg Ala Ala Ala Glu Ala Phe Pro Val Phe 
1220 1225 1230 

gga egg gcg ctg ggc gag gtg tgc gee gcg ctg gac ccg ctg ctg aca 3744 
Gly Arg Ala Leu Gly Glu Val Cys Ala Ala Leu Asp Pro Leu Leu Thr 
1235 1240 1245 

cgc eca ctg acc teg gtg atg tgg gcg get ccc ggc tec gag gag gcg 3792 
Arg Pro Leu Thr Ser Val Met Trp Ala Ala Pro Gly Ser Glu Glu Ala 
1250 1255 1260 

gcc cgt etc gac gac acc acc tac aeg cag ccc gcc ctg ttc gcc gtc 3840 
Ala Arg Leu Asp Asp Thr Thr Tyr Thr Gin Pro Ala Leu Phe Ala Val 
1265 1270 1275 1280 

cag gtc gee ctg tac egg ctg ttc gag tee tgg ggc gtg gtg ccg gac 3 888 
Gin Val Ala Leu Tyr Arg Leu Phe Glu Ser Trp Gly Val Val Pro Asp 
1285 1290 1295 

cag ctg gtg ggg eat teg gtc ggc gag ate tec gee gee eat gtg gea 3 936 
Gin Leu Val Gly His Ser Val Gly Glu lie Ser Ala Ala His Val Ala 
1300 1305 1310 

ggc gtg etc ggc etc egg gac gcg tgc acc ctg gtg gcg gcc cgt age 3984 
Gly Val Leu Gly Leu Arg Asp Ala Cys Thr Leu Val Ala Ala Arg Ser 
1315 1320 1325 

^SS ctg atg ggc gcg ctg ccg ccc ggc ggt geg atg gtg gcg gta cgc 4032 
Arg Leu Met Gly Ala Leu Pro Pro Gly Gly Ala Met Val Ala Val Arg 
1330 1335 1340 

ate aeg gaa ccc gaa gtg acc eca tgg etc gcg gag ttg aeg gac gag 4080 
He Thr Glu Pro Glu Val Thr Pro Trp Leu Ala Glu Leu Thr Asp Glu 
1345 1350 1355 1360 
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gtg teg ate gcg gee gtc aac ggt ccg cac tec etc gtg etc geg ggc 4128 
Val Ser lie Ala Ala Val Asn Gly Pro His Ser Leu Val Leu Ala Gly 
1365 1370 1375 

gee gag gee eeg etc gte gee ete aeg gac egg etc gee gee gee gga 4176 
Ala Glu Ala Pro Leu Val Ala Leu Thr Asp Arg Leu Ala Ala Ala Gly 
1380 1385 1390 

cae aag acc egg ege etc atg gtg age ace geg eee eac teg eeg ctg 4224 
His Lys Thr Arg Arg Leu Met Val Ser Thr Ala Pro His Ser Pro Leu 
1395 1400 1405 

atg gac ccc atg ctg gag gag ttc cgc gcg gtc gtc cgc acg ctg tec 4272 
Met Asp Pro Met Leu Glu Glu Phe Arg Ala Val Val Arg Thr Leu Ser 
1410 1415 1420 

tae gee geg eee gee gtt ccc etc gtc tec acc gtc acc ggc cgc ccg 4320 
Tyr Ala Ala Pro Ala Val Pro Leu Val Ser Thr Val Thr Gly Arg Pro 
1425 1430 1435 1440 

etg ace ggc gag gag gcg cgc gac eeg gac eae tgg gtg egg eat gtg 4368 
Leu Thr Gly Glu Glu Ala Arg Asp Pro Asp His Trp Val Arg His Val 
1445 1450 1455 

egg cag tec gtc cgc ttc aag gac gcg ate ggc egg etc egg gac gaa 4416 
Arg Gin Ser Val Arg Phe Lys Asp Ala lie Gly Arg Leu Arg Asp Glu 
1460 1465 1470 

cgc gte ace ggg ttc ctg gag ctg ggt gee gaa ccg gea etc aca ccg 4464 
Arg Val Thr Gly Phe Leu Glu Leu Gly Ala Glu Pro Ala Leu Thr Pro 
1475 1480 1485 

atg ate gac gag tge etg gag tec gee gac ggg cag ccc ggg ace gee 4512 
Met lie Asp Glu Cys Leu Glu Ser Ala Asp Gly Gin Pro Gly Thr Ala 
1490 1495 1500 

ctg gtg ccg agt ctg cgc gee gga gtg eeg gag egg gat gee ctg etc 4560 
Leu Val Pro Ser Leu Arg Ala Gly Val Pro Glu Arg Asp Ala Leu Leu 
1505 1510 1515 1520 

acc geg gtc gee egg gtg cac gcc cag ggc gtt ccc gtc gac tgg gac 4608 
Thr Ala Val Ala Arg Val His Ala Gin Gly Val Pro Val Asp Trp Asp 
1525 1530 1535 

geg gtg etc ccc ggg gee gag get tec gtc acc gtg cgc gga ctg ccc 4656 
Ala Val Leu Pro Gly Ala Glu Ala Ser Val Thr Val Arg Gly Leu Pro 
1540 1545 1550 

gee gcc gac cgc cag tgg ttc cgc ttc gtc ccc gac cag ggc gcg ccg 4704 
Ala Ala Asp Arg Gin Trp Phe Arg Phe Val Pro Asp Gin Gly Ala Pro 
1555 1560 1565 

etc acc etc gcc gac cgc teg ctg cac ctg gag ggc gcc gcc cac etc 4 752 
Leu Thr Leu Ala Asp Arg Ser Leu His Leu Glu Gly Ala Ala His Leu 
1570 1575 1580 

cgc gac gtg ggc ggc tgt cgc ace gcc gac ggc egg tgg gtg aaa atg 4800 
Arg Asp Val Gly Gly Cys Arg Thr Ala Asp Gly Arg Trp Val Lys Met 
1585 1590 1595 1600 

ggc gtc ctg tae cgc acc aac aat ctg cac geg etc ace gac gee gac 4 848 
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Gly Val Leu Tyr Arg Thr Asn Asn Leu His Ala Leu Thr Asp Ala Asp 
1605 1610 1615 



ctg gcc aag etc cag cgc etc ggc ate cgt acg gac ttc gac ctg cgc 4 8 96 
Leu Ala Lys Leu Gin Arg Leu Gly lie Arg Thr Asp Phe Asp Leu Arg 
1620 1625 1630 

atg ccg ggc gag cgc gcc aaa gcc ccg aac cgc gtg ccc acg ggc gcg 4 944 
Met Pro Gly Glu Arg Ala Lys Ala Pro Asn Arg Val Pro Thr Gly Ala 
1635 1640 1645 

cgc tat ate gtc gcc gat gcc ttt gac gca cat ctg egg gac ggg ctg 4 992 
Arg Tyr He Val Ala Asp Ala Phe Asp Ala His Leu Arg Asp Gly Leu 
1650 1655 1660 

agt ctg age gag egg gac ctg gag egg ctg cgc gag gcc egg ctg tec 5040 
Ser Leu Ser Glu Arg Asp Leu Glu Arg Leu Arg Glu Ala Arg Leu Ser 
1665 1670 1675 1680 

ggt etc gac gag gcc gaa cag cac gcg etc gta ctg gee ctg gtc etc 508 8 
Gly Leu Asp Glu Ala Glu Gin His Ala Leu Val Leu Ala Leu Val Leu 
1685 1690 1695 

gee gag ace teg gee gtg etc ggc ggc cag gag acg ect ggc gag gag 513 6 
Ala Glu Thr Ser Ala Val Leu Gly Gly Gin Glu Thr Pro Gly Glu Glu 
1700 1705 1710 

ccg cac ggc gag gag ggc cac cgc acg ttc aag gag atg ggc ate aac 5184 
Pro His Gly Glu Glu Gly His Arg Thr Phe Lys Glu Met Gly He Asn 
1715 1720 1725 

teg ttg aac gee gtc gaa ctg cgc aac cgc ett ate gca gee acg gac 5232 
Ser Leu Asn Ala Val Glu Leu Arg Asn Arg Leu He Ala Ala Thr Asp 
1730 1735 1740 

Ctg egg ett ccc gcc acg etc gtc tac gac tac ccc acg ccg aag gcc 52 8 0 
Leu Arg Leu Pro Ala Thr Leu Val Tyr Asp Tyr Pro Thr Pro Lys Ala 
1745 1750 1755 1760 

gtc gtc ega etc gta cgc gaa ega etc gcg cga ccg gcc tec ccc gca 5328 
Val Val Arg Leu Val Arg Glu Arg Leu Ala Arg Pro Ala Ser Pro Ala 
1765 1770 1775 

cgc gat gtg gcc tec gtc gtg gee gag ttg gag tec ctg ctg acg gcc 53 76 
Arg Asp Val Ala Ser Val Val Ala Glu Leu Glu Ser Leu Leu Thr TQa 
1780 1785 1790 

ggt gcg gag gtc teg gag gag acc gcg gcg egg ttg aaa gcg gtg acg 5424 
Gly Ala Glu Val Ser Glu Glu Thr Ala Ala Arg Leu Lys Ala Val Thr 
1795 1800 1805 

gcg gtg teg acg ggg acg acg ggg teg ggc age ggt acg ggc gcc ggc 54 72 
Ala Val Ser Thr Gly Thr Thr Gly Ser Gly Ser Gly Thr Gly Ala Gly 
1810 1815 1820 

tec ggc ggg get ctg gac ctg gta teg gcc agt gac gag gaa ctg ttc 552 0 
Ser Gly Gly Ala Leu Asp Leu Val Ser Ala Ser Asp Glu Glu Leu Phe 
1825 1830 1835 1840 

egg ctg atg gac gcg gag age tga 5544 
Arg Leu Met Asp Ala Glu Ser 
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1845 



<210> 2 
<211> 1848 
<212> PRT 

<213> Streptomyces natalensis 
<400> 2 

Met Val Pro Val His Thr Asp Asp Tyr Ala lie Gin Pro Pro Ala Asp 
15 10 15 

Thr Ala His Gly Gly Gly Gly Phe Thr Leu Pro Ala Val Phe Glu Ala 
20 25 30 

Ala Val Glu Ser Ala Pro Asp Ala Val Ala Leu Val Asp Gly Thr Val 

35 40 45 

Pro Gly Pro Gly Arg Met Trp Arg Ala Asp Val Asp Ala Leu Ala Arg 
50 55 60 

Gly Leu Gin Glu Ser Gly lie Ala Pro Gly Asp Val Val Ala Val Arg 
65 70 75 80 

Leu Pro Asn Cys Gly Arg Phe Pro Thr Leu His Leu Ala Val Ala Ala 
85 90 95 

Val Gly Ala Val Leu Leu Pro lie His Gin Gly Thr Pro Leu Pro Glu 
100 105 110 

Val Asp Ala Leu Leu Thr Arg Ala Glu Pro Ala Leu Leu Val Leu Ser 
115 120 125 

Ala Ala Gly Ser Asp Gly Leu Ala Thr Ala Arg Ser Leu Leu Glu Ser 
130 135 140 

Val Pro Ser Leu Arg Gly Val Leu Leu Ala Gly Ala Ser Gly Asp Gly 
145 150 155 160 

Glu Ser Gly Ser Val Gly Gly Gly Glu Ser Gly Ser Gly Arg Arg Ser 
165 170 175 

Leu Asp Gly Leu Leu Ala Gly Trp Ala Gly Ser Gly Pro Arg Pro Val 
180 185 190 

Asp Val Thr Pro Asp Met Pro Leu Val Leu Val Pro Ser Ser Gly Thr 
195 200 205 

Val Ser Ala Arg Pro Lys Leu Cys Val His Ser His Asp Gly Leu Leu 
210 215 220 

Ser Asn Thr Ala Ala Val Thr Ala Glu Ala Ala Asp Ala Phe Asp Gly 
225 230 235 240 

Pro Val Leu Thr Ala Cys Pro Met Thr His Leu Phe Gly I^eu Gin Ser 
245 250 255 

Leu His Ala Ala Leu Phe Ala Ala Cys Thr Gin Val Leu Leu Thr Gly 
260 265 270 

Trp Asp Val Asp Arg Phe Leu Glu Gin Ala Arg Glu His Gly Pro Arg 
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275 



280 



285 



Val Val Phe Ala Val Pro Ala Gin Leu Arg Asp Val Val Thr Arg lieu 
290 295 300 

Ala Arg Thr Gly Glu Pro. Ala Gly Phe Thr Pro Tyr Gin Val Arg Thr 
305 310 315 320 

Ala Gly Ala Ala Val Ala Pro Ala Leu Ala Val Arg Val Arg Ala Val 
325 330 335 

Leu Asp Cys Glu Leu Val Val Val Trp Gly Met Ser Glu lie Gly Thr 
340 345 350 

Gly Thr Arg Thr Arg Ala His His Pro Asp Gly Cys Val Gly Glu Pro 
355 360 365 

Val Ser Gly Val Asp Val Arg Val Val Asp Glu His Gly Gin Glu Cys 
370 375 380 . 

Ala Ala Asp Glu Arg Gly Glu Leu Gin Tyr Arg Gly Pro Gly Leu Phe 
385 390 395 400 

Arg Gly Tyr Phe Arg Glu Pro Glu Leu Thr Arg Ser Ala Leu Thr Asp 
405 410 415 

Asp Gly Trp Leu Arg Thr Gly Asp Leu Ala Thr Val Asp Ala Asp Gly 
420 425 430 

Val Val Val Leu His Gly Arg P^la Ala Glu Leu lie Asn Thr Gly Gly 
435 440 445 

Arg Lys Phe Ser Ala Gly Glu Val Glu Gly Leu Leu Ser Gly Phe Thr 
450 455 460 

Asp Leu Gly Pro Leu Ala Val Val Gly Ala Pro Asp Asp Arg Leu Gly 
465 470 475 480 

Glu Tyr Pro Cys Leu Val Val Thr Asp His Ala Asp Gly Thr lie Gly 
485 490 495 

Leu Ser Glu Val Thr Ala Phe Leu Arg Arg Leu Gly Leu Ala Asp His 
500 505 510 

Lys lie Pro Leu Glu Leu Val Thr Val Arg Glu Leu Pro Phe Ser Pro 
515 520 525 

Ala Gly Lys Leu Asp Arg Gly Ala Leu Lys Arg Leu Leu Ala Asn Lieu 
530 535 540 

Ala Glu Val Ser Val Pro Ala Arg Leu Gly Ala Val Pro Pro Tyr Thr 
545 550 555 560 

Ala Glu Glu Ala Leu Asp Leu Val Arg Asp Cys Val Gly Arg Val Leu 
565 570 575 

Arg Tyr Gly Gly Ala Ala Val Pro Phe Pro Pro Asp Lys Asp Phe Phe 
580 585 590 

Ser Pro Asp Lys Asp Phe Arg Gin Leu Gly Leu Asp Ser lie Gly Ala 



595 



600 



605 
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Val Arg Leu Arg Asn Leu Leu Arg Glu Glu Thr Gly Leu Pro Leu Pro 
610 615 620 

Ala Thr Leu Ala Phe Asp Ser Pro Thr Pro Arg Ala Val Ala Arg Val 
625 630 635 640 

Leu Ala Glu Gin Glu Glu Pro Ser Gin Asp Glu Pro Arg Glu Asn Pro 
645 650 655 

Ala Asp Gly Ala Asp Pro Val Ala lie Val Gly Met Ala Cys Arg Leu 
660 665 670 

Pro Gly Gly Ala Asp Ser Pro Asp Ala Leu Trp Glu Leu Leu Ala Asp 
675 680 685 

Gly Thr Asp Ala Met Ser Pro Phe Pro Thr Asp Arg Gly Trp Asp Leu 
690 695 700 

Asp Arg Leu Phe Asp Glu Asp Ala Asp Arg Pro Gly Thr Ser Tyr Ala 
705 710 715 720 

Arg Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Gly Phe 
725 730 735 

Phe Gly Leu Ser Asp Gin Glu Ala Thr Ala Thr Asp Pro Gin Gin Arg 
740 745 750 

Leu Leu Leu Glu Ala Ala Trp Glu Thr Phe Glu Arg Ala Gly lie Asp 
755 760 765 

Pro Gin Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Thr Gly Ala Met 
770 775 780 

Asp Arg Gly Tyr Gly Thr Ser Ala Ser Ala Ala Pro Ser Ala Trp Glu 
785 790 795 800 

Ser Met Leu lie Thr Gly Thr Ala Gly Ser Ala Val Ser Gly Arg lie 
805 810 815 

Ala Tyr Thr Tyr Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala 
820 825 830 

Ser Ser Ser Ser Leu Val Ala Leu His Leu Ala Cys Arg Ser Leu Arg 
835 840 845 

Ser Gly Glu Thr Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala 
850 855 860 

Thr Pro Ala Pro Phe Ala His Phe Ser Arg Leu Arg Ala Leu Ser Pro 
865 870 875 880 

Asp Ser Arg Ser Met Ala Tyr Ala Asp Ala Ala Asn Gly Ser Ala Trp 
885 890 895 

Ser Glu Gly Ala Gly Leu Leu Leu Leu Glu Arg Leu Ser Asp Ala Arg 
900 905 910 

Arg Asn Gly His Arg Val Leu Ala Leu Val Arg Gly Ser Ala Val Asn 
915 920 925 
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Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin 
930 935 940 

Gin Arg Val lie Arg Gin Ala Leu Ala Asp Ala Gly Leu Thr Pro Gin 
945 950 955 960 

Asp Val Asp Ala Val Glu Gly His Gly Thr Gly Thr Pro Leu Gly Asp 
965 970 975 

Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Pro 
980 985 990 

Val Glu Arg Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly His 
995 1000 1005 

Thr Gin Ala Ala Ala Gly Val Val Gly Val lie Lys Thr Val Leu Ala 
1010 1015 1020 

Leu Arg His Gly Val Leu Pro Gin Thr Leu His Val Asp Ala Pro Ser 
025 1030 1035 1040 

Ala Lys Val Asp Trp Ser Ala Gly Ser Val Arg Leu Leu Thr Glu Ala 
1045 1050 1055 

Arg Pro Trp Pro Arg Glu Ser Gly Arg Thr Arg Arg Ala Gly Val Ser 
1060 1065 1070 

Ser Phe Gly Leu Thr Gly Thr Asn Ala His Val lie Leu Glu Glu Ala 
1075 1080 1085 

Pro Gly Glu Ala Ala Ala Gly Ala Arg Ala Glu Val Pro Glu Glu Ala 
1090 1095 1100 

Arg Cys Ala Ser Ser Pro Ala Arg Leu Pro Glu Pro Pro Gly Asp Ala 
105 1110 1115 1120 

Ala Ala Pro Trp Val Leu Ser Ala Arg Ser Arg Ala Ala Leu Arg Ala 
1125 1130 1135 

Gin Ala Leu Arg Leu Ala Asp Gin Val Ala Ala Asp Pro Gly Leu Arg 
1140 1145 1150 

Ala Gin Asp Val Ala His Ala Leu Ala Thr Ser Arg Thr Leu His Arg 
1155 1160 1165 

His Arg Ala Val Val Ser Gly Ser Asp Arg Ala Gin Met Leu Ala Ala 
1170 1175 1180 

Ala Lys Arg Phe Gly Leu Gly Glu Arg Thr Ala Gly Val Thr Pro Asp 
185 1190 1195 1200 

Asp Ser Ala Pro Gly Leu Leu Ala Phe Val Phe Ser Gly Gin Gly Ser 
1205 1210 1215 

Gin Arg Ser Gly Met Gly Arg Ala Ala Ala Glu Ala Phe Pro Val Phe 
1220 1225 1230 

Gly Arg Ala Leu Gly Glu Val Cys Ala Ala Leu Asp Pro Leu Leu Thr 
1235 1240 1245 

Arg Pro Leu Thr Ser Val Met Trp Ala Ala Pro Gly Ser Glu Glu Ala 
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1250 



1255 



1260 



Ala Arg Leu Asp Asp Thr Thr Tyr Thr Gin Pro Ala Leu Phe Ala Val 
265 1270 1275 1280 

Gin Val Ala Leu Tyr Arg Leu Phe Glu Ser Trp Gly Val Val Pro Asp 
1285 1290 1295 

Gin Leu Val Gly His Ser Val Gly Glu lie Ser Ala Ala His Val Ala 
1300 1305 1310 

Gly Val Leu Gly Leu Arg Asp Ala Cys Thr Leu Val Ala Ala Arg Ser 
1315 1320 1325 

Arg Leu Met Gly Ala Leu Pro Pro Gly Gly Ala Met Val Ala Val Arg 
1330 1335 1340 

lie Thr Glu Pro Glu Val Thr Pro Trp Leu Ala Glu Leu Thr Asp Glu 
345 1350 1355 1360 

Val Ser lie Ala Ala Val Asn Gly Pro His Ser Leu Val Leu Ala Gly 
1365 1370 1375 

Ala Glu Ala Pro Leu Val Ala Leu Thr Asp Arg Leu Ala Ala Ala Gly 
1380 1385 1390 

His Lys Thr Arg Arg Leu Met Val Ser Thr Ala Pro His Ser Pro Leu 
1395 1400 1405 

Met Asp Pro Met Leu Glu Glu Phe Arg Ala Val Val Arg Thr Leu Ser 
1410 1415 1420 

Tyr Ala Ala Pro Ala Val Pro Leu Val Ser Thr Val Thr Gly Arg Pro 
425 1430 1435 1440 

Leu Thr Gly Glu Glu Ala Arg Asp Pro Asp His Trp Val Arg His Val 
1445 1450 1455 

Arg Gin Ser Val Arg Phe Lys Asp Ala lie Gly Arg Leu Arg Asp Glu 
1460 1465 1470 

Arg Val Thr Gly Phe Leu Glu Leu Gly Ala Glu Pro Ala Leu Thr Pro 
1475 1480 1485 

Met lie Asp Glu Cys Leu Glu Ser Ala Asp Gly Gin Pro Gly Thr Ala 
1490 1495 1500 

Leu Val Pro Ser Leu Arg Ala Gly Val Pro Glu Arg Asp Ala Leu Leu 
505 1510 1515 1520 

Thr Ala Val Ala Arg Val His Ala Gin Gly Val Pro Val Asp Trp Asp 
1525 1530 1535 

Ala Val Leu Pro Gly Ala Glu Ala Ser Val Thr Val Arg Gly Leu Pro 
1540 1545 1550 

Ala Ala Asp Arg Gin Trp Phe Arg Phe Val Pro Asp Gin Gly Ala Pro 
1555 1560 1565 

Leu Thr Leu Ala Asp Arg Ser Leu His Leu Glu Gly Ala Ala His Leu 
1570 1575 1580 
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Arg Asp Val Gly Gly Cys Arg Thr Ala Asp Gly Arg Trp Val Lys Met 
585 1590 1595 * 1600 

Gly Val Leu Tyr Arg Thr Asn Asn Leu His Ala Leu Thr Asp Ala Asp 
1605 1610 1615 

Leu Ala Lys Leu Gin Arg Leu Gly lie Arg Thr Asp Phe Asp Leu Arg 
1620 1625 1630 

Met Pro Gly Glu Arg Ala Lys Ala Pro Asn Arg Val Pro Thr Gly Ala 
1635 1640 1645 

Arg Tyr lie Val Ala Asp Ala Phe Asp Ala His Leu Arg Asp Gly Leu 
1650 1655 1660 

Ser Leu Ser Glu Arg Asp Leu Glu Arg Leu Arg Glu Ala Arg Leu Ser 
665 1670 1675 1680 

Gly Leu Asp Glu Ala Glu Gin His Ala Leu Val Leu Ala Leu Val Leu 
1685 1690 1695 

Ala Glu Thr Ser Ala Val Leu Gly Gly Gin Glu Thr Pro Gly Glu Glu 
1700 1705 1710 

Pro His Gly Glu Glu Gly His Arg Thr Phe Lys Glu Met Gly lie Asn 
1715 1720 1725 

Ser Leu Asn Ala Val Glu Leu Arg Asn Arg Leu lie Ala Ala Thr Asp 
1730 1735 1740 

Leu Arg Leu Pro Ala Thr Leu Val Tyr Asp Tyr Pro Thr Pro Lys Ala 
745 1750 1755 1760 

Val Val Arg Leu Val Arg Glu Arg Leu Ala Arg Pro Ala Ser Pro Ala 
1765 1770 1775 

Arg Asp Val Ala Ser Val Val Ala Glu Leu Glu Ser Leu Leu Thr Ala 
1780 1785 1790 

Gly Ala Glu Val Ser Glu Glu Thr Ala Ala Arg Leu Lys Ala Val Thr 
1795 1800 1805 

Ala Val Ser Thr Gly Thr Thr Gly Ser Gly Ser Gly Thr Gly Ala Gly 
1810 1815 1820 

Ser Gly Gly Ala Leu Asp Leu Val Ser Ala Ser Asp Glu Glu Leu Phe 
825 1830 1835 1840 

Arg Leu Met Asp Ala Glu Ser 
1845 



<210> 3 
<211> 20394 
<212> DNA 

<213> Streptomyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (20394) 
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<400> 3 

atg teg aac gag gag aag ctg cgc gag tac etc aag ege gcg ate gcg 48 

Met Ser Asn Glu Glu Lys Leu T^g Glu Tyr Leu Lys Arg Ala lie Ala 
15 10 15 

gac ctt cac gag act cgt cag caa ttg gac gag acc gag gcg aag cag 96 
Asp Leu His Glu Thr Arg Gin Gin Leu Asp Glu Thr Glu Ala Lys Gin 
20 25 30 

cga gag cce etc gcg ate gtg teg atg gee tgc ege ttc cec ggc ggc 144 
Arg Glu Pro Leu Ala lie Val Ser Met Ala Cys Arg Phe Pro Gly Gly 
35 40 45 

gte cgt teg eee gag gag ctg tgg gag ctg ctg cgc gac ggc gtc gac 192 
Val A.rg Ser Pro Glu Glu Leu Trp Glu Leu Leu Arg Asp Gly Val Asp 
50 55 60 

gcg gtt tec tec ttc cce cgt aac cgc ggc tgg gac ctg gac gcg etc 240 
Ala Val Ser Ser Phe Pro Arg Asn Arg Gly Trp Asp Leu Asp Ala Leu 
€5 70 75 80 

tac cac tee gac ccg gee cac cag ggc acc age tat gcg cgc gag ggc 288 
Tyr His Ser Asp Pro Ala His Gin Gly Thr Ser Tyr Ala Arg Glu Gly 
85 90 95 

gga ttc ctg cat gac gcg ggc gag ttc gac cce ggc ttc ttc ggg ate 336 
Gly Phe Leu His Asp TLLa Gly Glu Phe Asp Pro Gly Phe Phe Gly lie 
100 105 110 

tec ccg cgc gag gcg etc gee atg gac cce cag cag egg ctg ctg ctg 384 
Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

gag acc gca tgg gaa gee gtc gag egg gee ggt ate gac ccg gag tec 432 
Glu Thr Ala Trp Glu Ala Val Glu Arg Ala Gly lie Asp Pro Glu Ser 
130 135 140 

etc gcg ggc age cga aeg ggt gtc ttt gtc ggc ace ggg cac gga ggg 4 80 
Leu Ala Gly Ser Arg Thr Gly Val Phe Val Gly Thr Gly His Gly Gly 
145 150 155 160 

tac gac gee gag ggc cga egg cgt gee gac gag gtc ggc ggg cac ttg 528 
Tyr Asp Ala Glu Gly Arg Arg Arg Ala Asp Glu Val Gly Gly His Leu 
165 170 175 

ctg aeg ggc aat cac ate age ate gee tee ggc egg att teg tat gte 576 
Leu Thr Gly Asn His lie Ser lie Ala Ser Gly Arg lie Ser T\'r Val 
180 185 190 

ggg ctg gaa ggc cct gee ctg acc gtg gac aeg gee tgc tec teg 624 
Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

teg ctg gte gee ctg cat ctg gcg atg cac gcg ctg egg cgc gac gaa 672 
Ser Leu Val Ala Leu His Leu Ala Met His Ala Leu Arg Arg Asp Glu 
210 215 220 

tgc gee atg gee ctg gtg ggc ggc gcg ace gtg atg tec aeg ccg cag 720 
Cys Ala Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr Pro Gin 
225 230 235 240 
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atg ttc gtg gag ttc tec cgc cag cgc ggc ctt gcc ccc gac ggc cgc 768 
Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

tgc aag ccg ttc gcg gcc gcc gcc gac ggc acc ggc tgg age gag ggt 816 
Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly 
260 265 270 

gtc gga ctg ctg etc gtc gag egg etc agt gac gcc gta cgc aac ggc 864 
Val Gly Leu Leu Leu Val Glu Arg Leu Ser Asp Ala Val Arg Asn Gly 
275 280 285 

tat ccc gtc etc gcc gtg ctg aag ggc teg gcc gtc aac cag gac ggc 912 
Tyr Pro Val Leu Ala Val Leu Lys Gly Ser Ala Val Asn Gin Asp Gly 
290 295 300 

gcg tee aac ggc etg aec gee ccc aac ggc ccc teg cag eaa cge gtc 960 
Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 
305 310 315 320 

ate cgc cag gcg ctg acc ggc gcg ggc etc gcc gcc teg gac ate gac 1008 
lie Arg Gin Ala Leu Thr Gly Ala Gly Leu Ala Ala Ser Asp lie Asp 
325 330 335 

gcc gtg gag gcg cac ggc acc ggc acc acc etc ggc gac ccc gtc gag 1056 
Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Val Glu 
340 345 350 

gcg cae gee etg ctg gee ace tae ggg eag eag ege gee gee gae egg 1104 
Ala His Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Ala Ala Asp Arg 

355 360 365 

ccg tgc gga etc ggc tec atg aag tec aac ate ggg cac acc cag gcc 1152 
Pro Cys Gly Leu Gly Ser Met Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

gcc gcc ggt ate gcc ggc gtg atg aaa atg gtc etg gcg atg egg cac 1200 
Ala Ala Gly lie Ala Gly Val Met Lys Met Val Leu Ala Met Arg His 
385 390 395 400 

ggg cae etg eee agg aec etg cac etg gac gag ccc ace ggg cae gtc 1248 
Gly His Leu Pro Arg Thr Leu His Leu Asp Glu Pro Thr Gly His Val 
405 410 415 

gac tgg age gag ggc aac gcc agg etc etc gcg gag ccc gag ccc tgg 1296 
Asp Trp Ser Glu Gly Asn Ala Arg Leu Leu Ala Glu Pro Glu Pro Trp 
420 425 430 

ccg age gcc ggc egg ccc cgt cgc gcc gcc gtc tec tec ttc ggc ate 1344 
Pro Ser Ala Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly lie 
435 440 445 

ggc ace aac gee cae gtc ate ctg gag cag gcg ccc gee cae gag 1392 
Ser Gly Thr Asn Ala His Val He Leu Glu Gin Ala Pro Ala His Glu 
450 455 460 

gee gaa eeg gee eee gaa eeg gee gcc egg eeg ggc gcg etg eee tgg 144 0 
Ala Glu Pro Ala Pro Glu Pro Ala Ala Arg Pro Gly Ala Leu Pro Trp 
465 470 475 480 
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ate ctg tec gcc cgc acc gaa gcg ggc ctg cgt gcc cag gcc gac cgc 1488 
lie Leu Ser Ala Arg Thr Glu Ala Gly Leu Arg Ala Gin Ala Asp Arg 
485 490 495 

etc ggc ege eac eta egg gae ege gee gac etc gaa ccg gee gcc gte 1536 
Leu Gly Arg His Leu Arg Asp Arg Ala Asp Leu Glu Pro Ala Ala Val 
500 505 510 

gcc eat gcg etc gcg gac acc egg acc etc atg gaa cac cgc gcg gta 1584 
Ala His Ala Leu Ala Asp Thr Arg Thr Leu Met Glu His Arg Ala Val 
515 520 525 

gte gte gcg ggc gae egg gag gag tte ctg cgc ggc ctg gac gee etc 1632 
Val Val Ala Gly Asp Arg Glu Glu Phe Leu Arg Gly Leu Asp Ala Leu 
530 535 540 

gcc gcg ggc ege ace gee aae ggc ctg gte age ggc gte gee gte aag 1680 
Ala Ala Gly Arg Thr Ala Asn Gly Leu Val Ser Gly Val Ala Val Lys 
545 550 555 560 

gcc gcc age gcg tte etc tte gee ggg cag ggc tec cag ega ccg ggc 1728 
Ala Ala Ser Ala Phe Leu Phe Ala Gly Gin Gly Ser Gin Arg Pro Gly 
565 570 575 

atg ggg ege gaa ctg eac gee geg cac ccc gtg tte gcc acg gee tte 1776 
Met Gly Arg Glu Leu His Ala AJ.a His Pro Val Phe Ala Thr Ala Phe 
580 585 590 

gac geg gtg tgc gee gaa ctg gae eea cac ctg gac egg ceg ctg cgc 1824 
Asp Ala Val Cys Ala Glu Leu Asp Pro His Leu Asp Arg Pro Leu Arg 
595 600 605 

gac ate gte tte gcc gag gaa ggc age gcc gag gcc gcc ctg etc gac 1872 
Asp lie Val Phe Ala Glu Glu Gly Ser Ala Glu Ala Ala Leu Leu Asp 
610 615 620 

cag ace gee tae acc cag gcc geg etc tte gee ctg gaa ace gee ctg 192 0 
Gin Thr Ala Tyr Thr Gin Ala Ala Leu Phe Ala Leu Glu Thr Ala Leu 
625 630 635 640 

tte egg etc gte gaa tee tgg ggc gtg gca ccc egg tte gte gcc gga 1968 
Phe Arg Leu Val Glu Ser Trp Gly Val Ala Pro Arg Phe Val Ala Gly 
645 650 655 

cac tee ate ggc gag ctg ace gee gee eac gte agt ggc gtg ctg ace 2016 
His Ser lie Gly Glu Leu Thr Ala Ala His Val Ser Gly Val Leu Thr 
660 665 670 

etc cac gac gee gca egg ctg gte gcc gcg cgc ggc acc etc atg cag 2064 
Leu His Asp Ala Ala Arg Leu Val Ala Ala Arg Gly Thr Leu Met Gin 
675 680 685 

gcg ctg ccc gca ggc ggc gee atg gtg gcg gte cag gcc acc gag gac 2112 
Ala Leu Pro Ala Gly Gly Ala Met Val Ala Val Gin Ala Thr Glu Asp 
690 695 700 

gag ate ege gag cgt etc gee ggc eac gag gac cac gte gee etc gcg 2160 
Glu lie Arg Glu Arg Leu Ala Gly His Glu Asp His Val Ala Leu Ala 
705 710 715 720 

gcc gee aae ggg ccc gat tec ace gte att teg ggc gac gaa cag gcc 2208 
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Ala Ala Asn Gly Pro Asp Ser Thr Val lie Ser Gly Asp Glu Gin Ala 
725 730 735 



gtc acc gag ate gcg gcc cac tgg gag gca cag ggc cgc cgc acc aag 2256 
Val Thr Glu lie Ala Ala His Trp Glu Ala Gin Gly Arg Arg Thr Lys 
740 745 750 

egg ctg egg gtc age eac gcc ttc cac tee eeg cae atg gae gac atg 2304 
Arg Leu Arg Val Ser His Ala Phe His Ser Pro His Met Asp Asp Met 
755 760 765 

ctg gag gac ttc egg cgc gtc gcc cgc ggt ctg acc ttc eac gee eee 23 52 
Leu Glu Asp Phe Arg Arg Val Ala Arg Gly Leu Thr Phe His Ala Pro 
770 775 780 

cgc ate ccc gtg gtg tec acg gtg acc ggc gcg etc gcc ace gaa gac 2400 
Arg lie Pro Val Val Ser Thr Val Thr Gly Ala Leu Ala Thr Glu Asp 
785 790 795 800 

gaa ctg ege teg eee gae tae tgg gtg egg cag gtc cgc gaa acc gtc 2448 
Glu Leu Arg Ser Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Thr Val 
805 810 815 

cgc ttc tgt get gcc gtg ege acc ctt gag gee gag ggc gtc ace acc 24 96 
Arg Phe Cys Ala Ala Val Arg Thr Leu Glu Ala Glu Gly Val Thr Thr 
820 825 830 

ttc gtg gag ate ggc acc ggc ggc gtc etc acc ccc atg gtc cag gac 2544 
Phe Val Glu lie Gly Thr Gly Gly Val Leu Thr Pro Met Val Gin Asp 
835 840 845 

tgt ctg ace ace etc gaa gag ccc gtt etc gtc cet ctg ctg cgc acc 2592 
Cys Leu Thr Thr Leu Glu Glu Pro Val Leu Val Pro Leu Leu Arg Thr 
850 855 860 

ggc cgc ccc gaa acc gtc get etc ace gag ggc gtc gcc ace gee ttc 2640 
Gly Arg Pro Glu Thr Val Ala Leu Thr Glu Gly Val Ala Thr Ala Phe 
865 870 875 880 

gtg cac ggt gtc ccc gtc gac egg tec gcc ttc ccg ggc gcg ccc ggt 2688 
Val His Gly Val Pro Val Asp Arg Ser Ala Phe Pro Gly Ala Pro Gly 
885 890 895 

acc tec cgc gcg gac ctg ccc ace tae gee ttc cag cgt cag tgg tae 2736 
Thr Ser Arg Ala Asp Leu Pro Thr Tyr Ala Phe Gin Arg Gin Trp Tyr 
900 905 910 

tgg ctg gac ccg gcc gac cae gae gag ggg gag gcg gcc gcc gcc gaa 2 784 
Trp Leu Asp Pro Ala Asp His Asp Glu Gly Glu Ala Ala Ala Ala Glu 
915 920 925 

gcg ggc gag gcc gga ttc tgg gcg gcc gtc gaa cgc gag gac etc cag 2832 
Ala Gly Glu Ala Gly Phe Trp Ala Ala Val Glu Arg Glu Asp Leu Gin 
930 935 940 

gag ctg teg gcc gtc ctg gcc ate gac ggc age gaa gcg gae tec etc 2880 
Glu Leu Ser Ala Val Leu Ala lie Asp Gly Ser Glu Ala Asp Ser Leu 
945 950 955 960 

ggc age ttc ctg eee acc etc tec tec tgg cgc agg cag cgc agg acc 2928 
Gly Ser Phe Leu Pro Thr Leu Ser Ser Trp Arg Arg Gin Arg Arg Thr 



18 



wo 00/77222 PCT/EPOO/06227 

965 970 975 

cag gcc gcc gcg gac cgc ttc age tac cgc acc cac tgg gcc ccg cgc 2976 
Gin Ala Ala Ala Asp Arg Phe Ser Tyr Arg Thr His Trp Ala Pro Arg 
980 985 990 

acc gcc teg ggc ggc ccc acc gcc acc ggg cac tgg etc gtc gtc ctg 3024 
Thr Ala Ser Gly Gly Pro Thr Ala Thr Gly His Trp Leu Val Val Leu 
995 1000 1005 

ccc gaa ggc ggc acc gac gac ccg tgg acc gcc cgc etc ctg gac gcg 3 072 
Pro Glu Gly Gly Thr Asp Asp Pro Trp Thr Ala Arg Leu Leu Asp Ala 
1010 1015 1020 

ctg aac gac cag ggc ctg cac acc gac gta cgc gaa ctg ccc gcc gac 3120 
Leu Asn Asp Gin Gly Leu His Thr Asp Val Arg Glu Leu Pro Ala Asp 
1025 1030 1035 1040 

cac gag ccc gac gcc tgg ggc cga cac ccc gtg gac ggc gtg etc tgt 3168 
His Glu Pro Asp Ala Trp Gly Arg His Pro Val Asp Gly Val Leu Cys 
1045 1050 1055 

ctg ctg gca etc gac gag egg ccc acc cgc tec tgc cet ccg tac egg 3216 
Leu Leu Ala Leu Asp Glu T^g Pro Thr Arg Ser Cys Pro Pro Tyr Arg 
1060 1065 1070 

cgc ggg ctg gcc gcc acc acc aac get get gcg cgc cct gag ggc gcg 3264 
Arg Gly Leu Ala Ala Thr Thr Asn Ala Ala Ala Arg Pro Glu Gly Ala 
1075 1080 1085 

ggc ate cag gca ccg ctg tgg tgc gtg acc cgc ggc gcc gtc gcc gtc 3312 
Gly lie Gin Ala Pro Leu Trp Cys Val Thr Arg Gly Ala Val Ala Val 
1090 1095 1100 

gac egg cac gag gcg etc aag age ccc tta cag gca cag aca tgg ggc 3360 
Asp Arg His Glu Ala Leu Lys Ser Pro Leu Gin Ala Gin Thr Trp Gly 
1105 1110 1115 1120 

ctg ggc egg gtg gee gcc ctg gag tec ccg cag age tgg ggc ggg etc 34 08 
Leu Gly Arg Val Ala Ala Leu Glu Ser Pro Gin Ser Trp Gly Gly Leu 
1125 1130 1135 

ate gac ctg ccc gac aac ctg gac gga egg gcc gtc tec gcg ctg ctg 3456 
lie Asp Leu Pro Asp Asn Leu Asp Gly Arg Ala Val Ser Ala Leu Leu 
1140 1145 1150 

age acc etc gcc ggg gag gag gac cag gtc gcc gtc cgc ccc gcc ggg 35 04 
Ser Thr Leu Ala Gly Glu Glu Asp Gin Val Ala Val Arg Pro Ala Gly 
1155 1160 1165 

gtc ttc gcc cgc cgc ctg gaa egg ate aca ccc ggc ggc gac acc ggc 3 552 
Val Phe Ala Arg Arg Leu Glu Arg lie Thr Pro Gly Gly Asp Thr Gly 
1170 1175 1180 

gac egg tgg age acc cac ggc acc gtc ctg gtc acc ggc ggc acc ggt 3600 
Asp Arg Trp Ser Thr His Gly Thr Val Leu Val Thr Gly Gly Thr Gly 
1185 1190 1195 1200 

gcc etc ggc gcg cac etc gcc cac tgg ctg gcc gac gcc gga gee gaa 364 8 
Ala Leu Gly Ala His Leu Ala His Trp Leu Ala Asp Ala Gly Ala Glu 
1205 1210 1215 
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cac etc gtg etc acc ggc cgc egc ggc ccg cag gee ccc ggc gea ecg 3696 
His Leu Val Leu Thr Gly Arg Arg Gly Pro Gin Ala Pro Gly Ala Pro 
1220 1225 1230 

gaa etc gcg gcc gcc etc acc gac egg ggc gtc aag gtc acc etc gee 3744 
Glu Leu Ala Ala Ala Leu Thr Asp Arg Gly Val Lys Val Thr Leu Ala 
1235 1240 1245 

gee tgc gac gcc gcc gac cgt gat gcg ctg gcg gcc gtc etc gcg gac 3 7 92 
Ala Cys Asp Ala Ala Asp Arg Asp Ala Leu Ala Ala Val Leu Ala Asp 
1250 1255 1260 

ate eeg ecg eae ctg ccg etg ace ggc gtc gtc eae gee geg gge gta 384 0 
lie Pro Pro His Leu Pro Leu Thr Gly Val Val His Ala Ala Gly Val 
1265 1270 1275 1280 

ctg gac gac ggc gta ctg gac gcg etc ace ccc gag egc ttc gag ace 3888 
Leu Asp Asp Gly Val lieu Asp Ala Leu Thr Pro Glu Arg Phe Glu Thr 
1285 1290 1295 

gta etg cgc ccc aag gcg egg gcc gea cag aac ctg cac gaa etc ace 3 936 
Val Leu Arg Pro Lys Ala Arg Ala Ala Gin Asn Leu His Glu Leu Thr 
1300 ' 1305 1310 

cag gac etc gac etg gac cac ttc gtg etg ttc tee teg ate gtc gge 3984 
Gin Asp Leu Asp Leu Asp His Phe Val Leu Phe Ser Ser lie Val Gly 
1315 1320 1325 

gtc ctg ggc aac gcc gga cag gcc aac tac gee gcc gee aac gcc tac 4032 
Val Leu Gly Asn Ala Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Tyr 
1330 1335 1340 

ttg gac gcc etc gcc gaa cac cgt etc gcc cag ggg etc ccg gcc ace 4080 
Leu Asp Ala Leu Ala Glu His Arg Leu Ala Gin Gly Leu Pro Ala Thr 
1345 1350 1355 1360 

tec gtg tec tgg ggc cet ggg cag gcg gcg gea tgg cac gac age gac 412 8 
Ser Val Ser Trp Gly Pro Gly Gin Ala Ala Ala Trp His Asp Ser Asp 
1365 1370 1375 

gee gee gac egg atg age cgc gac gga ctg etg ccc atg gee geg gcc 4176 
Ala Ala Asp Arg Met Ser Arg Asp Gly Leu Leu Pro Met Ala Ala Ala 
1380 1385 1390 

ccg cgt cgc cgc cet gcg eca gcc etc gcc cag ggc atg aca cag gtg 4224 
Pro Arg Arg Arg Pro Ala Pro Ala Leu Ala Gin Gly Met Thr Gin Val 
1395 1400 1405 

acc gtg gcc gac ate gac tgg age gea tac gee ccc gcc ctg ace gcc 4272 
Thr Val Ala Asp lie Asp Trp Ser Ala Tyr Ala Pro Ala Leu Thr Ala 
1410 1415 1420 

gtc cgc ccc age eee etc ate ggc gac ctg ccc gag gea egc cgc gcg 4320 
Val Arg Pro Ser Pro Leu lie Gly Asp Leu Pro Glu Ala Arg Arg Ala 
1425 1430 1435 1440 

etc gge ccc gea gaa gge ccc cgc egg gaa cgc tec ccc etg cgc gac 4368 
Leu Gly Pro Ala Glu Gly Pro Arg Arg Glu Arg Ser Pro Leu Arg Asp 
1445 1450 1455 
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egg ate ggc gca ctg ccg ccc gcc gaa cag gaa aag gca ttc ctg acc 4416 
Arg lie Gly Ala Leu Pro Pro Ala Glu Gin Glu Lys Ala Phe Leu Thr 
1460 1465 1470 

atg gtc agg gaa gag gcc gcg agg gta ctg gga cac ccc teg ccg gac 44 64 
Met Val Arg Glu Glu Ala Ala Arg Val Leu Gly His Pro Ser Pro Asp 
1475 1480 1485 

acc gtc gat gcc caa cgc gcc ttc cgc gag cag ggg ttc gac tec ctg 45X2 
Thr Val Asp Ala Gin Arg Ala Phe Arg Glu Gin Gly Phe Asp Ser Leu 
1490 1495 1500 

atg gcc gtc gac ctg cgc aac egg etc tee g-ce gcg aeg ggc ctg egg 4 560 
Met Ala Val Asp Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg 
1505 1510 1515 1520 

Ctg ccc gee acc ctg ctg ttc gac cae ccc acc ccc ctt gcg gcc gee 46 08 
Leu Pro Ala Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ala Ala Ala 
1525 1530 1535 

gcc tgc ctg cgc tec gaa gtc ctg ggc gcc gca gga ccc gcc aeg gtc 4656 
Ala Cys Leu Arg Ser Glu Val Leu Gly Ala Ala Gly Pro Ala Thr Val 
1540 1545 1550 

gtt cag gca teg acc gcc gee etc gac gaa ccg gtg gcg ate ate ggc 4 704 
Val Gin Ala Ser Thr Ala Ala Leu Asp Glu Pro Val Ala He He Gly 
1555 1560 1565 

atg gcc tgc cgc ttc ccc ggc ggc gtg cae tea ccc gag gcc ctg tgg 4 752 
Met Ala Cys Arg Phe Pro Gly Gly Val His Ser Pro Glu Ala Leu Trp 
1570 1575 1580 

egg ctg ctg gcc gag ggc ggc gac gcc ate acc ccc atg ccc gcc gac 4800 
Arg Leu Leu Ala Glu Gly Gly Asp Ala He Thr Pro Met Pro Ala Asp 
1585 1590 1595 1600 

egg ggc tgg gac ctg gac egg etc tac cae ccc gac ccc gac cae cag 4 84 8 
Arg Gly Trp Asp Leu Asp Arg Leu Tyr His Pro Asp Pro Asp His Gin 
1605 1610 1615 

ggc acc age tac gee cgc ggc ggc ggc ttc ctg gac ggc gcg gee gac 4 896 
Gly Thr Ser Tyr Ala Arg Gly Gly Gly Phe Leu Asp Gly Ala Ala Asp 
1620 1625 1630 

ttc gac gcg gac ttc ttc ggc ate teg ccg cgc gag gcc etc gcc atg 4944 
Phe Asp Ala Asp Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

gac ccg cag cag egg ctg etc ctg gaa aea tgg gag gtg etc gaa cag 4 992 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Trp Glu Val Leu Glu Gin 
1650 1655 1660 

gcg ggg ate gac ccg gaa tee ctg egg ggc age age ace ggt gtc ttc 504 0 
Ala Gly He Asp Pro Glu Ser Leu Arg Gly Ser Ser Thr Gly Val Phe 
1665 1670 1675 1680 

gcg ggc acc aac acc cag gac tac ggc aeg gcc ctg gac gcg gca cag 5088 
Ala Gly Thr Asn Thr Gin Asp Tyr Gly Thr Ala Leu Asp Ala Ala Gin 
1685 1690 1695 

gac gaa gcc ggc gga cac egg ccc ace ggc aac gcg atg age gtc gtc 5136 
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Asp Glu Ala Gly Gly His Arg Leu Thr Gly Asn Ala Met Ser Val Val 
1700 1705 1710 

tec ggc egg gtc tec tac ace ttc ggc ttc gag gga ccg gee etc aec 5184 
Ser Gly Arg Val Ser T^'r Thr Phe Gly Phe Glu Gly Pro Ala Leu Thr 
1715 1720 1725 

gtg gac acg gcg tgc tec tec teg etg gtg gee etg eac atg gee geg 5232 
Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Met Ala Ala 
1730 1735 1740 

eag geg etg ege cag ggc gaa tgc tee etg gcg gtc geg ggc ggt gtg 528 0 
Gin Ala Leu Arg Gin Gly Glu Cys Ser Leu Ala Val Ala Gly Gly Val 
1745 1750 1755 1760 

acg gtg atg gcc acc ccg tec tec ttc gtg gag ttc gee egg eag cgc 5328 
Thr Val Met Ala Thr Pro Ser Ser Phe Val Glu Phe Ala Arg Gin Arg 
1765 1770 1775 

ggg etg gee cec gac ggc cgc tgc aag ccg ttc geg gcg gcc gee gae 53 76 
Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp 
1780 1785 1790 

ggc acc ggc tgg age gag ggc gtc ggc etg etg etc gtg gaa egg etc 5424 
Gly Thr Gly Trp Ser Glu Gly Val Gly Leu Leu Leu Val Glu Arg Leu 
1795 1800 1805 

age gac gee cgc cga aae ggc cae cag gtg etc gcc gtc gtc cgc ggt 5472 
Ser Asp Ala Arg Arg Asn Gly His Gin Val Leu Ala Val Val Arg Gly 
1810 1815 1820 

teg gcg gtc aae eag gac ggc gcg tee aae ggt etg age gea cec age 5520 
Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Ser 
1825 1830 1835 1840 

ggc ccg tec cag cag egg gtg ate egg eag gcc etg gcg aae gcc egg 5568 
Gly Pro Ser Gin Gin Arg Val lie Arg Gin Ala Leu Ala Asn Ala Arg 
1845 1850 1855 

gtg gee gee tee gag gtc gac gee gtg gag gee eac ggc acg ggc acc 5616 
Val Ala Ala Ser Glu Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
1860 1865 1870 

acg etc ggt gae ccg ate gag gee cag geg etg etg gcc acc tac ggc 5664 
Thr Leu Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly 
1875 1880 1885 

eag gag egg ccg etg etg etc ggc gcg gtg aag tec aac etc ggc eac 5712 
Gin Glu Arg Pro Leu Leu Leu Gly Ala Val Lys Ser Asn Leu Gly His 
1890 1895 1900 

acc cag gcc gee gee ggt gtg geg ggc gtg atg aag atg gtg etg gcg 576 0 
Thr Gin Ala Ala Ala Gly Val Ala Gly Val Met Lys Met Val Leu Ala 
1905 1910 1915 1920 

atg egg eac ggc atg etg ccg cgc aec etg eac gtc gac gag cec ace 5808 
Met Arg His Gly Met Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr 
1925 1930 1935 

ggg eat gtc gac tgg ace gcg ggc geg gtc gag etg etc acc gag eac 5856 
Gly His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu His 
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1940 1945 1950 

acg gac tgg ccc gag acc ggc cac ccc egg cgc gcc gcg gtc ccc gcg 5904 
Thr Asp Trp Pro Glu Thr Gly His Pro Arg Arg Ala Ala Val Ser Ala 

1955 1960 1965 

ttc ggc ate age gge aee aat geg cae gtg gtg ctg gaa ctg cce gea 5 952 
Phe Gly lie Ser Gly Thr Asn Ala His Val Val Leu Glu Leu Pro Ala 
1970 1975 1980 

gee gaa cag ccc ttg gte gaa cag eee teg gee geg gag eee gae geg 6 000 
Ala Glu Gin Pro Leu Val Glu Gin Pro Ser Ala Ala Glu Pro Asp Ala 
1985 1990 1995 2000 

ccg gcc acc get ccc gac egg aeg ccc acc gcc tec gac ggg aeg geg 6 048 
Pro Ala Thr Ala Pro Asp Arg Thr Pro Thr Ala Ser Asp Gly Thr Ala 
2005 2010 2015 

ccg ctg ctg etc tec gcc aag age gag age gcc ctg cgc gcc cag gcg 6 096 
Pro Leu Leu Leu Ser Ala Lys Ser Glu Ser Ala Leu Arg Ala Gin Ala 
2020 2025 2030 

gcc egg etg eae tec eac etg gag egc gac cce gcg etc egg etc aeg 6144 
Ala Arg Leu His Ser His Leu Glu Arg Asp Pro Ala Leu Arg Leu Thr 
2035 2040 2045 

gae gee geg tac acg etg atg aeg eae cgc aeg gee ttc gcc cac cgc 6192 
Asp Ala Ala Tyr Thr Leu Met Thr His Arg Thr Ala Phe Ala His Arg 
2050 2055 2060 

gcg gcc gtc cgc gcc gcc gae eac gaa gcc gcg ctg cgc gcc ctg acc 6240 
Ala Ala Val Arg Ala Ala Asp His Glu Ala Ala Leu Arg Ala Leu Thr 
2065 2070 2075 2080 

gee ctg get gcg ggc gag gee gae ccc gee gtg gac ace gge acc gee 6288 
Ala Leu Ala Ala Gly Glu Ala Asp Pro Ala Val Asp Thr Gly Thr Ala 
2085 2090 2095 

cae aee gge egg gae gee gte etc ttc tee gge cag gga teg caa cgc 6336 
His Thr Gly Arg Asp Ala Val Leu Phe Ser Gly Gin Gly Ser Gin Arg 
2100 2105 2110 

ate gga atg ggc egg gag ttg tec ggc cgc tac ccg gtg ttc gea gag 6384 
lie Gly Met Gly Arg Glu Leu Ser Gly Arg Tyr Pro Val Phe Ala Glu 
2115 2120 2125 

gee ttc gac aee gtg tge geg gee ttg gae gag cat etg gac cgc ccc 6432 
Ala Phe Asp Thr Val Cys Ala Ala Leu Asp Glu His Leu Asp Arg Pro 
2130 2135 2140 

ctg egg gae gtg gte egg ggc gag gae gag gag etg etg aac egg aee 64 80 
Leu Arg Asp Val Val Arg Gly Glu Asp Glu Glu Leu Leu Asn Arg Thr 
2145 2150 2155 2160 

gtc tac gee cag geg ggg ctg ttc gee ate gag gtg gcc etc ttc egg 6528 
Val Tyr Ala Gin Ala Gly Leu Phe Ala lie Glu Val Ala Leu Phe Arg 
2165 2170 2175 

etc gtg gag tec tgg ggc gta egg ccg cac tac gtg gcc ggg cat tec 6576 
Leu Val Glu Ser Trp Gly Val Arg Pro His Tyr Val Ala Gly His Ser 
2180 2185 2190 
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gtc ggc gag ate gcc gcc gcg cac gtc gcc ggg gtg ttc teg ctg gcc 6624 
Val Gly Glu He Ala Ala Ala His Val Ala Gly Val Phe Ser Leu Ala 
2195 2200 2205 

gat gcc tgc gcg ctg gtg gcg gca cgc gga egg ctg atg cag gcg ctg 6672 
Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu 
2210 2215 2220 

ccc gcc ggc ggc gcg atg gcg gcg ate egg gcg acg gag gac gaa gtc 6 720 
Pro Ala Gly Gly Ala Met Ala Ala He Arg Ala Thr Glu Asp Glu Val 
2225 2230 2235 2240 

etc ccg cac ctg gcg gac age gtc teg ate gcg gcc gtc aac ggc ccg 6768 
Leu Pro His Leu Ala Asp Ser Val Ser He Ala Ala Val Asn Gly Pro 
2245 2250 2255 

teg teg gte gtc gtc tee ggc gee gag eac gee gtg etc tee ate gee 6816 
Ser Ser Val Val Val Ser Gly Ala Glu His Ala Val Leu Ser He Ala 
2260 2265 2270 

gcg cac ttc gag ggc gcg ggc ege aag ace ace agg ctg egg gtc teg 6864 
Ala His Phe Glu Gly Ala Gly Arg Lys Thr- Thr Arg Leu Arg Val Ser 
2275 2280 2285 

cac gcc ttc cac tec ccg etc atg gac ccg atg ctg gcc gac ttc cgc 6912 
His Ala Phe His Ser Pro Leu Met Asp Pro Met Leu Ala Asp Phe Arg 
2290 2295 2300 

gee gte gcc gag ggc etg ace tae ggc gag ccg gag ctg gee gtc gta 6960 
Ala Val Ala Glu Gly Leu Thr Tyr Gly Glu Pro Glu Leu Ala Val Val 
2305 2310 2315 2320 

teg aac gtc ace ggc eaa etc gee ace ccg gac eag etg ege ace ccc 7008 
Ser Asn Val Thr Gly Gin Leu Ala Thr Pro Asp Gin Leu Arg Thr Pro 
2325 2330 2335 

gag tac tgg gtg acc cat gtc cgc gcg gcg gtg cgc ttc gcg gac ggg 7056 
Glu Tyr Trp Val Thr His Val Arg Ala Ala Val Arg Phe Ala Asp Gly 
2340 2345 2350 

ata egg get etg ggg gcg gaa ggg gtg acg egg ttc etc gaa etc ggc 7104 
He Arg Ala Leu Gly Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly 
2355 2360 2365 

ccg gac ggc gtc etg teg gee ttg gee agg gag teg gca ccg gac gac 7152 
Pro Asp Gly Val Leu Ser Ala Leu Ala Arg Glu Ser Ala Pro Asp Asp 
2370 2375 2380 

gcc gtg tgc act ccc gtg ctg cgc aag gac cgc tec gag gcg gcg acc 72 00 
Ala Val Cys Thr Pro Val Leu Arg Lys Asp Arg Ser Glu Ala Ala Thr 
2385 2390 2395 2400 

etc etc gcg gcc ctg acg eac etg eac gta cac gga acc gag ate gac 724 8 
Leu Leu Ala Ala Leu Thr His Leu His Val His Gly Thr Glu He Asp 
2405 2410 2415 



tgg ace geg ttc etc gee ggc cgc gac gcg eae gee gte gae etg ccc 
Trp Thr Ala Phe Leu Ala Gly Arg Asp Ala His Ala Val Asp Leu Pro 
2420 2425 2430 



7296 
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acg tac gcc ttc cag cac cag egg ttc tgg ccg acc ccc gac cac acc 7344 
Thr Tyr Ala Phe Gin His Gin Arg Phe Trp Pro Thr Pro Asp His Thr 
2435 2440 2445 

cgc acc ggt gac ctg ggc gcc gtc ggc etc gaa gcg acc ggg cac ccg 73 92 
Arg Thr Gly Asp Leu Gly Ala Val Gly Leu Glu Ala Thr Gly His Pro 
2450 2455 2460 

ctg ctg age gcc gcc gtg gaa ctg ccg gac ggt gag ggc ctg ttg ttc 7440 
Leu Leu Ser Ala Ala Val Glu Leu Pro Asp Gly Glu Gly Leu Leu Phe 
2465 2470 2475 2480 

acc acc cgc etc teg etc cag ace cac ccc tgg ctg gcc ggg cac gtc 7488 
Thr Thr Arg Leu Ser Leu Gin Thr His Pro Trp Leu Ala Gly His Val 
2485 2490 2495 

gtc atg ggc teg gtc ctg ctg ccg ggg acg gcc ttc gcc gaa etc gcc 7536 
Val Met Gly Ser Val Leu Leu Pro Gly Thr Ala Phe Ala Glu Leu Ala 
2500 2505 2510 

etc cgc gcc gcc gac gag gtg ggc tgc gac cgc gtc gac gaa ctg acc 7584 
Leu Arg Ala Ala Asp Glu Val Gly Cys Asp Arg Val Asp Glu Leu Thr 
2515 2520 2525 

ctg gcc gcc ccg etc gtc ctg ccc gag cac ggc ggc gta cag etc cag 7632 
Leu Ala Ala Pro Leu Val Leu Pro Glu His Gly Gly Val Gin Leu Gin 
2530 2535 2540 

ctg egg gtg ggc ccc gcc gac gcg tec ggc cgc cgc acc ctg ace gcc 7680 
Leu Arg Val Gly Pro Ala Asp Ala Ser Gly Arg Arg Thr Leu Thr Ala 
2545 2550 2555 2560 

cgc tec agg gcg gag ggc gac ggc gac cgc ccg tgg gtc cag cac gcc 7728 
Arg Ser Arg Ala Glu Gly Asp Gly Asp Arg Pro Trp Val Gin His Ala 
2565 2570 2575 

acc ggc gtc etc gcg gaa ggg gag teg acg ccc gaa ccc ggc tac gac 7 7 76 
Thr Gly Val Leu Ala Glu Gly Glu Ser Thr Pro Glu Pro Gly Tyr Asp 
2580 2585 2590 

ttc cac ace gag tec tgg ccg ccc gcc gac gcc gcg ccc gtc gaa ctg 7824 
Phe His Thr Glu Ser Trp Pro Pro Ala Asp Ala Ala Pro Val Glu Leu 
2595 2600 2605 

tec ggc etc tac ccg gac ttc gcc gca cac ggt ttc gac tac ggt ccc 7872 
Ser Gly Leu Tyr Pro Asp Phe Ala Ala His Gly Phe Asp Tyr Gly Pro 
2610 2615 2620 

cac ttc cag ggg ctg egg acc gcc tgg cgc ega ggc gac gag gtg ttc 792 0 
His Phe Gin Gly Leu Arg Thr Ala Trp Arg Arg Gly Asp Glu Val Phe 
2625 2630 2635 2640 

gcc gag gtc gcc ctg ccc gcc gag gcc gaa ggc gag gca tec gcg tac 7968 
Ala Glu Val Ala Leu Pro Ala Glu Ala Glu Gly Glu Ala Ser Ala Tyr 
2645 2650 2655 

gga etc cat ccg gcg ctg etc gac gcc gcc ctg cac gtc gtc gcg ttc 8016 
Gly Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Val Val Ala Phe 
2660 2665 2670 

aac gga gtg gac cgc ggc gtc gtg ccg ttc tec tgg gag age gtc gcg 8064 
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Asn Gly Val Asp Arg Gly Val Val Pro Phe Ser Trp Glu Ser Val Ala 
2675 2680 2685 

ctg cac gcc acc ggc gcc teg gcc gta egg ate egg gtc gtc egg cae 8112 
Leu His Ala Thr Gly Ala Ser Ala Val Arg lie Arg Val Val Arg His 
2690 2695 2700 

sgc ggc gac acg gte tec gtg gat gtc gee gae ace ace ggc gag eee 8160 
Ser Gly Asp Thr Val Ser Val Asp Val Ala Asp Thr Thr Gly Glu Pro 
2705 2710 2715 2720 

gtc gcc tec ate ggc acg etc gte ctg egg geg gtc tec gcc gac cag 8208 
Val Ala Ser lie Gly Thr Leu Val Leu Arg Ala Val Ser Ala Asp Gin 
2725 2730 2735 

ttg gcg ggc ggc gcg gac ccg gcc gtc cgc gat geg ctg ttc cgc gtg 8256 
Leu Ala Gly Gly Ala Asp Pro Ala Val Arg Asp Ala Leu Phe Arg Val 
2740 2745 2750 

cag tgg aac eee gta cgc ctg eee ccg gcc ggg gee gcg gtg acc gtg 83 04 
Gin Trp Asn Pro Val Arg Leu Pro Pro Ala Gly Ala Ala Val Thr Val 
2755 2760 2765 

gcg acg etc ggc tec ett gee ggc gea ccg ttc gac ggc tac ccg gac 8352 
Ala Thr Leu Gly Ser Leu Ala Gly Ala Pro Phe Asp Gly Tyr Pro Asp 
2770 2775 2780 

ctg gcg tec ctg gcc egg tec ggt cgt gtg geg ggt gcg gtg ctg gta 84 00 
Leu Ala Ser Leu Ala Arg Ser Gly Arg Val Ala Gly Ala Val Leu Val 
2785 2790 2795 2800 

ccg gtg gaa gcc ggt gcc ggc gag gtg gtg gcg gac gat gtc gtg ggg 844 8 
Pro Val Glu Ala Gly Ala Gly Glu Val Val Ala Asp Asp Val Val Gly 
2805 2810 2815 

gcg acg cac gca acg gcc gcc egg gcg ctg gac ctg gcc egg teg tgg 8496 
Ala Thr His Ala Thr Ala Ala Arg Ala Leu Asp Leu Ala Arg Ser Trp 
2820 2825 2830 

ctg gcc gat gac egg ttc gcg gcc teg cgc ctg gtg ttc gtg acg cgt 8544 
Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe Val Thr Arg 
2835 2840 2845 

ggc geg gtg tec ggt geg gat etc geg ggt geg gcg gtg tgg ggt ctg 8592 
Gly Ala Val Ser Gly Ala Asp Leu Ala Gly Ala Ala Val Trp Gly Leu 
2850 2855 2860 

gtg egg teg gcg ctg teg gag cac ccg ggc cgc ttc ggt ctg gtg gat 8640 
Val Arg Ser Ala Leu Ser Glu His Pro Gly Arg Phe Gly Leu Val Asp 
2865 2870 2875 2880 

ctg gat gac gat gcc gaa ctg gcg ctg gtg cea egg gtg ttg gcg teg 8688 
Leu Asp Asp Asp Ala Glu Leu Ala Leu Val Pro Arg Val Leu Ala Ser 
2885 2890 2895 

gat gag ccg cag ctg ctg gtg cgc ggt ggt gag gtg ctg gcg gcg egg 8736 
Asp Glu Pro Gin Leu Leu Val Arg Gly Gly Glu Val Leu Ala Ala Arg 
2900 2905 2910 

ctg gcc egg geg cag tec teg cac gcg gtg acc tgg gat ccg tec ggc 8784 
Leu Ala Arg Ala Gin Ser Ser His Ala Val Thr Trp Asp Pro Ser Gly 
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2915 2920 2925 

acg gtg etc gtc acc ggt ggc acg ggt ggt ctg ggc cgt gtg atg gca 8832 
Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Val Met Ala 
2930 2935 2940 

cgt cac ttg gtg gtg gaa cac ggg gta egg aac ctg ctg ctg gtc age 8880 
Arg His Leu Val Val Glu His Gly Val Arg Asn Leu Leu Leu Val Ser 
2945 2950 2955 2960 

cgc cgt ggg ccc gcc gcc gaa ggt gcc gaa gag ctg gtg acg gag etc 8928 
Arg Arg Gly Pro Ala Ala Glu Gly TLLa Glu Glu Leu Val Thr Glu Leu 
2965 2970 ' 2975 

egg cac age ggt gcc gaa gtg gcc gtc gaa gcc tgt gat gtc acc gac 8976 
Arg His Ser Gly Ala Glu Val Ala Val Glu Ala Cys Asp Val Thr Asp 
2980 2985 2990 

geg gcc gee gtg gcc gac ctg gtg gee egg cac egg ate age get gtg 9024 
Ala Ala Ala Val Ala Asp Leu Val Ala Arg His Arg lie Ser Ala Val 
2995 3000 3005 

gtg cat acg gcc ggt gtc ctg gat gac ggt gtg gtg gag teg ctg aca 9072 
Val His Thr Ala Gly Val Leu Asp Asp Gly Val Val Glu Ser Leu Thr 
3010 3015 3020 

ccg gag egg ctg teg geg gtg ttg cgt ccg aag gtg gat gcg gcc tgg 912 0 
Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys Val Asp Ala Ala Trp 
3025 3030 3035 3040 

aac ctg cac gag gcg acc agg gat ctg gac ctg gac gcg ttc gtg gtc 9168 
Asn Leu His Glu Ala Thr Arg Asp Leu Asp Leu Asp Ala Phe Val Val 
3045 3050 3055 

ttc tec tea gtg gca ggc acg ate ggg age ccc ggt eag gee aac tac 9216 
Phe Ser Ser Val Ala Gly Thr lie Gly Ser Pro Gly Gin Ala Asn Tyr 
3060 3065 3070 

gcg gcg ggc aac gcc ttc ctg gat gcc ctg gcc cac cac cgt egg gcg 9264 
Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala His His Arg Arg Ala 
3075 3080 3085 

9^S ggt ctt ccg gcg gcg teg ctg gca tgg ggc ccc tgg tec egg gac 9312 
Ala Gly Leu Pro Ala Ala Ser Leu Ala Trp Gly Pro Trp Ser Arg Asp 
3090 3095 3100 

99^ 99^ «itg acc ggc acc ctg ace gac gtc gac tec age gca teg ccc 9360 
Gly Gly Met Thr Gly Thr Leu Thr Asp Val Asp Ser Ser Ala Ser Pro 
3105 3110 3115 3120 

ggc agg cat gee ega act cac ccc cgc aca ggg cgt ggc etc ttc gac 94 08 
Gly Arg His Ala Arg Thr His Pro Arg Thr Gly Arg Gly I^eu Phe Asp 
3125 3130 3135 

gcc gcg ctg gcg gcc ggt gac gcc cac ctg etc ccc gta cgc ttc gac 9456 
Ala Ala Leu Ala Ala Gly Asp Ala His Leu Leu Pro Val Arg Phe Asp 
3140 3145 3150 

tg9 gcg tec ctg ege gee cag ggc gag gtg cca eeg ctg ttg cgc ggc 9504 
Trp Ala Ser Leu Arg Ala Gin Gly Glu Val Pro Pro Leu Leu Arg Gly 
3155 3160 3165 
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ctg ate agg acc cgt gcc egg cgc teg gcg gtc ggc ggc teg gcc gcg 9552 
Leu lie Arg Thr Arg Ala Arg Arg Ser Ala Val Gly Gly Ser Ala Ala 
3170 3175 3180 

gca gcc ggc ctg gtg gga. cgc ctg age gga egg gga acg gtg gag egg 9600 
Ala Ala Gly Leu Val Gly Arg Leu Ser Gly Arg Gly Thr Val Glu Arg 
3185 3190 3195 3200 

cgc gag gtg etc ctg gac ctg gta egg gcc cag ate gcg gtc gtc ctg 9648 
Arg Glu Val Leu Leu Asp Leu Val Arg Ala Gin lie Ala Val Val Leu 
3205 3210 3215 

ggc cac gcg aac ccg gag acg ate gag tec aee cgt gtc ttc cag gac 9696 
Gly His Ala Asn Pro Glu Thr lie Glu Ser Thr Arg Val Phe Gin Asp 
3220 3225 3230 

etc ggc tte gac tec ctg ace gcg gte gaa etc cgc aac cgc etc aac 9744 
Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn 
3235 3240 3245 

aac gcg acc ggc ctg cgc ctt teg gcc acc gcc gtc ttc gac tac ecc 9792 
Asn Ala Thr Gly Leu TUrg Leu Ser Ala Thr Ala Val Phe Asp Tyr Pro 
3250 3255 3260 

acg gcg gac gcg etc gte gac ttc ctg ctg gac gag ctg tte ggc gcg 984 0 
Thr Ala Asp Ala Leu Val Asp Phe Leu Leu Asp Glu Leu Phe Gly Ala 
3265 3270 3275 3280 

cag gag gag gee gag ctg ccg gcg ccg gtg ccg tea ccg gcg ggg gcc 9888 
Gin Glu Glu Ala Glu Leu Pro Ala Pro Val Pro Ser Pro Ala Gly Ala 
3285 3290 3295 

gcc gac gac ccg gtc gtg ate gtc ggc atg age tgc cgc tac ccg ggc 9936 
Ala Asp Asp Pro Val Val lie Val Gly Met Ser Cys Arg Tyr Pro Gly 
3300 3305 3310 

ggc gtc ggc teg ccc gag gac ctg tgg cgc ctg gtg teg gag ggc gtg 9984 
Gly Val Gly Ser Pro Glu Asp Leu Trp Arg Leu Val Ser Glu Gly Val 
3315 3320 3325 

gac gcg gtg tec gac ttc ccc acc gac cgt gga tgg gac gtg gag age 10032 
Asp Ala Val Ser Asp Phe Pro Thr Asp Arg Gly Trp Asp Val Glu Ser 
3330 3335 3340 

etc tac age ccc gac ccc gag gcg etc ggc acc teg tac acc cgc tee 10080 
Leu Tyr Ser Pro Asp Pro Glu Ala Leu Gly Thr Ser Tyr Thr Arg Ser 
3345 3350 3355 3360 

ggt gga ttc etc cac gag gcg gcg gag ttc gac ccc gat ttc ttc ggg 10128 
Gly Gly Phe Leu His Glu Ala Ala Glu Phe Asp Pro Asp Phe Phe Gly 
3365 3370 3375 

atg age ccg cgc gag gcg ctg gcg acc gac gee cag cag egg ctg ctg 10176 
Met Ser Pro Arg Glu Ala Leu Ala Thr Asp Ala Gin Gin Arg Leu Leu 
3380 3385 3390 

Ctg gag acg acc tgg gag gee ate gag cgc acg ggc ate gac ccg gcg 10224 
Leu Glu Thr Thr Trp Glu Ala lie Glu Arg Thr Gly lie Asp Pro Ala 
3395 3400 3405 
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ccg ctg egg ggc age cgt acg ggc gtc ttc gcg ggc gtg atg tac acc 10272 
Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Val Met Txt Thr 
3410 3415 3420 

gac tac ggc gac etc etc gtc ggc gae cag ttc gag ggc tac cgc age 10320 
Asp Tyr Gly Asp Leu Leu Val Gly Asp Gin Phe Glu Gly Tyr Arg Ser 
3425 3430 3435 3440 

aac ggc age gcg gee age ate gcc tec ggc egg gtc teg tac acc ttc 10368 
Asn Gly Ser Ala Ala Ser lie Ala Ser Gly Arg Val Ser Tyr Thr Phe 
3445 3450 3455 

ggt ttc gag ggt ccg gcg gtc acg gtg gac acg gca tgc teg teg tec 10416 
Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
3460 3465 3470 

Ctg gtc gee ctg cae tgg gcg gcg cag teg ctg cgc teg ggc gag tgc 10464 
Leu Val Ala Leu His Trp Ala Ala Gin Ser Leu Arg Ser Gly Glu Cys 
3475 3480 3485 

teg etc gcg gtc gcg ggc ggt gtg acg gtg atg tec aca ccg acg acg 10512 
Ser Leu Ala Val Ala Gly Gly Val Thr Val Met Ser Thr Pro Thr Thr 
3490 3495 3500 

ttc gtc gag ttc teg egg caa cgc gga ctg teg gcg gae ggc cgc tgc 10560 
Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ser Ala Asp Gly Arg Cys 
3505 3510 3515 3520 

aag gcg ttc gcc gat gcg gcc gac ggc gtc ggc tgg ggc gag ggc gtc 10608 
Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly Trp Gly Glu Gly Val 
3525 3530 3535 

ggc atg etc gta ctg gag cgt ctg teg gac gcg cgc cgc aac ggg cac 10656 
Gly Met Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
3540 3545 3550 

egg gtg etc gcg gtg gtg cgc ggc agt gcg gtg aac cag gac ggt gcg 10704 
Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
3555 3560 3565 

tec aat ggt ctg acg gcg ccg aac ggc ccc gcc cag cag egg gtg ate 10752 
Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala Gin Gin Arg Val lie 
3570 3575 3580 

egg cag gcg ctg gcg agt gcg ggg ctg teg gcg gcg gat gtg gae gcg 10800 
Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala Ala Asp Val Asp Ala 
3585 3590 3595 3600 

gtg gag gcg cac ggt acg ggt acg acg ctg ggc gat ccg ate gag gcc 10848 
Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu Ala 
3605 3610 3615 

cag gcg ctg etc gcc acg tat ggc cag gag cga ect gag gac egg ccg 10896 
Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg Pro Glu Asp Arg Pro 
3620 3625 3630 

ttg ctg ctg ggg teg gtc aaa tee aac ate ggt cat gcg cag gcg get 10944 
Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly His Ala Gin Ala Ala 
3635 3640 3645 

teg ggt gtg gcg ggt gtc ate aag acg gtg ctg gcg atg egg cac ggt 10992 
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Ser Gly Val Ala Giy Val lie Lys Met Val Leu Ala Met Arg His Gly 
3650 3655 3660 

gtg ctg cct egg acg ctg cat gtg gat gaa ccg teg teg cat gtc gae 11040 
Val Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Ser His Val Asp 
3665 3670 3675 3680 

tgg agt gee ggt gee gtc gag ctg ctg aec tec gag gee gag tgg ccg 11088 
Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser Glu Ala Glu Trp Pro 
3685 3690 3695 

cag ggc gag ggg ccg cgc cgc gcg ggc gtc tec tec ttc ggc gtc age 11136 
Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
3700 3705 3710 

ggg acg aac gcg eat gtg ate ctg gag cag cec gga ccg gae gcg gee 11184 
Gly Thr Asn Ala His Val lie Leu Glu Gin Pro Gly Pro Asp Ala Ala 
3715 3720 3725 

gae gee gea ccg gae gcc acg gtg acc gat cec ggc gcg ctg gca tgg 11232 
Asp Ala Ala Pro Asp Ala Thr Val Thr Asp Pro Gly Ala Leu Ala Trp 
3730 3735 3740 

gtg etc tee gca egg aac gaa gcg gcc ctg cgc tgc cag gcg gcg cgc 11280 
Val Leu Ser Ala Arg Asn Glu Ala Ala Leu Arg Cys Gin Ala Ala Arg 
3745 3750 3755 3760 

ctg ctg tec ctg gtc gcc ggc agt gae gcg ctg tgc gcg egg gae ate 11328 
Leu Leu Ser Leu Val Ala Gly Ser Asp Ala Leu Cys Ala Arg Asp lie 
3765 3770 3775 

ggc cac teg ctg gtg acc ggg egg teg age ttc gcc eac cgt gcg gtg 11376 
Gly His Ser Leu Val Thr Gly Arg Ser Ser Phe Ala His Arg Ala Val 
3780 3785 3790 

gtg tgg ggc cag gae cgc gae gca ctg gtg cgt gcc ctg tec gca etc 11424 
Val Trp Gly Gin Asp Arg Asp Ala Leu Val Arg Ala Leu Ser Ala Leu 
3795 3800 3805 

gcg gtg ggc gag gee gae gcc ggt ctg gcg gag ggc gcg tec ggc gcg 114 72 
Ala Val Gly Glu Ala Asp Ala Gly Leu Ala Glu Gly Ala Ser Gly Ala 
3810 3815 3820 

ggg agg acg gcc ttc ctg ttc teg ggc cag gga tea caa egg ctg gga 11520 
Gly Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg Leu Gly 
3825 3830 3835 3840 

atg gga tgg gag ttg tac get cgc tac ccg gtg ttc gcg gae gca ttc 11568 
Met Gly Trp Glu Leu Tyr Ala Arg Tyr Pro Val Phe Ala Asp Ala Phe 
3845 3850 3855 

gae gcc gtg tgc gcg gcc ttg gae gag eac ctg gag cgc cec ctg egg 11616 
Asp Ala Val Cys Ala Ala Leu Asp Glu His Leu Glu Arg Pro Leu Arg 
3860 3865 3870 

gae gtg gtc tgg ggc gag gae gcg gag ctg ctg aac cag acc gcg tac 11664 
Asp Val Val Trp Gly Glu Asp Ala Glu Leu Leu Asn Gin Thr Ala Tyr 
3875 3880 3885 

gee cag gee ggg ctg ttc gcg ate gag gtg gcg ctg tac egg ctg gcg 11712 
Ala Gin Ala Gly Leu Phe Ala lie Glu Val Ala Leu Tyr Arg Leu Ala 
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3890 3895 3900 

gaa teg tgg ggc atg cgc ccg gac ttc gtg gcg ggg cat teg ate ggt 1176 0 
Glu Ser Trp Gly Met Arg Pro Asp Phe Val Ala Gly His Ser lie Gly 
3905 3910 3915 3920 

gag gtc gee gcg gcc cat gtg teg ggt gtc ttc teg etc ccg gat gee 11808 
Glu Val Ala Ala Ala His Val Ser Gly Val Phe Ser Leu Pro Asp Ala 
3925 3930 3935 

tgt gcg ctg gtg gcg gcc cga ggc cga ctg atg cag caa ctg ccc tec 11856 
Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu Pro Ser 
3940 3945 3950 

ggc ggc gcg atg atg gcg ate egg gcg ace gag gac gag gtc ett ccg 11904 
Gly Gly Ala Met Met Ala He Arg Ala Thr Glu Asp Glu Val Leu Pro 
3955 3960 3965 

cat ctg gcg gaa ggc gtc teg etc gcg gcg gtc aat ggc ccg teg teg 11952 
His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro Ser Ser 
3970 3975 3980 

gtc gtg ate teg ggc gee gag gac gcg gtg ctg gee ate gcg gcg cae 12000 
Val Val He Ser Gly Ala Glu Asp Ala Val Leu Ala He Ala Ala His 
3985 3990 3995 4000 

ttc gcg ggg gag ggg cgc aaa acc ace cga ctg egg gtc teg cat gee 1204 8 
Phe Ala Gly Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser His Ala 
4005 4010 4015 

tte eae teg ccg etc atg gaa ccg atg ctg gag gaa ttc cgc gcg gtg 12096 
Phe His Ser Pro Leu Met Glu Pro Met Leu Glu Glu Phe Arg Ala Val 
4020 4025 4030 

gtg aca egg ctg tee ttc ggc aeg ccg acg ate ccc gtc gtc tec aac 12144 
Val Thr Arg Leu Ser Phe Gly Thr Pro Thr He Pro Val Val Ser Asn 
4035 4040 4045 

ctg acg ggc cgc etc gcc gaa ccc gaa cag etc gcg cae gcc gac tac 12192 
Leu Thr Gly Arg Leu Ala Glu Pro Glu Gin Leu Ala His Ala Asp Tyr 
4050 4055 4060 

tgg gtc egg cac gtc cgc gag gea gtg cgc tte gcg gac ggg ata cag 1224 0 
Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala Asp Gly He Gin 
4065 4070 4075 4080 

gcg ctg egg gcg gaa ggg gtg aeg egg tte ctg gag etc ggc ccg gac 12288 
Ala Leu Arg Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly Pro Asp 
4065 4090 4095 

ggt gtg ctg teg gcg atg gee cgc gag teg gea teg gac gac gcc gtg 12336 
Gly Val Leu Ser Ala Met Ala Arg Glu Ser Ala Ser Asp Asp Ala Val 
4100 4105 4110 

etc gcg ccc gta ctg cgc agg gac egg ccc gag gag acg gcg ctg ctg 123 84 
Leu Ala Pro Val Leu Arg Arg Asp Arg Pro Glu Glu Thr Ala Leu Leu 
4115 4120 4125 

ggc gee ctg gcg cag ctg tac gtc egg ggt gcg cae gtg gac tgg aeg 12432 
Gly Ala Leu Ala Gin Leu Tyr Val Arg Gly Ala His Val Asp Trp Thr 
4130 4135 4140 
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gtg ccg ttc gcc ggt teg ggt gcg cgc tgg gcg gat ctg ccg acg tac 12480 

Val Pro Phe Ala Gly Ser Gly Ala Arg Trp Ala Asp Leu Pro Thr Tyr 

4145 4150 4155 4160 

gcg ttc cag cac gag egg ttc tgg ccg teg gge ggt gtg gea cgt ccg 1252 8 

Ala Phe Gin Kis Glu Arg Phe Trp Pro Ser Gly Gly Val Ala Arg Pro 

4165 4170 4175 

gge gat gtg egg tec gcg ggc ctg gge teg gee ggg cat ccg ctg ctg 12576 

Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro Leu Leu 

4180 4185 4190 

ggc gcg gcg gtg gaa ctg gcg ggc teg ggc ggc ctg ttg ttc acg ggc 12624 

Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Leu Leu Phe Thr Gly 

4195 4200 4205 

egg ctg teg gtg tec teg cac ccg tgg ctg gcg gac cat gtg gtg ctg 12672 

Arg Leu Ser Val Ser Ser His Pro Trp Leu Ala Asp His Val Val Leu 

4210 4215 4220 

ggc tec gtc etc gtg ccc gge ace gcg ctg gtg gaa ctg gtg ctg egg 1272 0 

Gly Ser Val Leu Val Pro Gly Thr Ala Leu Val Glu Leu Val Leu Arg 

4225 4230 4235 4240 

gcg gcc gac gag gcc ggc tgt gac etc ctg gag gag ctg acg etc gcc 12768 
Ala Ala Asp Glu Ala Gly Cys Asp Leu Leu Glu Glu Leu Thr Leu Ala 

4245 4250 4255 

gea ccg ctg gtg ctg ccc gcc teg ggc gcc gcg gtc cag gtt cag gta 12816 

Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala Val Gin Val Gin Val 

4260 4265 4270 

gcg gtg ggc gag ccc gat gag gcg ggc cgc egg ccg gtc teg gtc cat 12864 

Ala Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser Val His 

4275 4280 4285 

gea cgt gag ggc gag ggc eca tgg acg ctg cac gcc agt ggt gcg gtg 12912 

Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Ser Gly Ala Val 

4290 4295 4300 

acc teg ggc gcc gaa gtg ccc ccc ttc gac gcc ace gta tgg ccg ccc 12960 

Thr Ser Gly Ala Glu Val Pro Pro Phe Asp Ala Thr Val Trp Pro Pro 

4305 4310 4315 4320 

Srgc gcg gag ccc gtg gac gtg gcg gac tgc tac gac gta etc gee 13008 

Lys Gly Ala Glu Pro Val Asp Val Ala Asp Cys Tyr Asp Val Leu Ala 

4325 4330 4335 

gat gee ggg etc acc tac ggc ccg gee ttc eae ggc ctg caa gcg gee 13056 

Asp Ala Gly Leu Thr Tyr Gly Pro Ala Phe His Gly Leu Gin Ala Ala 

4340 4345 4350 

tgg aag etc ggt ggg gac gtc tac gee gag gcg aag etc ccc gag age 13104 

Trp Lys Leu Gly Gly Asp Val Tyr Ala Glu Ala Lys Leu Pro Glu Ser 

4355 4360 4365 

acc gac ggc gac gea tac ggt ctg cac ccc gcg etc ttc gac gcc gcg 13152 

Thr Asp Gly Asp Ala Tyr Gly Leu His Pro Ala Leu Phe Asp Ala Ala 

4370 4375 4380 
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ctg cac gcg teg gcg ctg ggc ggc gcg gaa gcg ggc gga gtc ccg ttc 
Leu His Ala Ser Ala Leu Gly Gly Ala Glu Ala Gly Gly Val Pro Phe 
4385 4390 4395 4400, 



13200 



tec tgg gcc gga gtg teg ctg cac gcg acc ggc gcc teg cac etc cgc 
Ser Trp Ala Gly Val Ser Leu His Ala Thr Gly Ala Ser His Leu Arg 
4405 4410 4415 



13248 



gtc cgc ate cgc gaa gcg ggc ggc gcg ctg teg gtc gcg ate gcg gae 
Val Arg lie Arg Glu Ala Gly Gly Ala Leu Ser Val Ala lie Ala Asp 
4420 4425 4430 



13296 



aeg tec ggc gcg ccg gtc gee teg gtg gag teg ctg gtg ata cgt ccg 
Thr Ser Gly Ala Pro Val Ala Ser Val Glu Ser Leu Val lie Arg Pro 
4435 4440 4445 



13344 



etc teg gcc ggg cag gtg cag gcc gcc gac cgt gac gcc etc ttc aag 
Leu Ser Ala Gly Gin Val Gin Ala Ala Asp Arg Asp Ala Leu Phe Lys 
4450 4455 4460 



13392 



gee gac tgg gtc cec gta eeg etc aeg gae gaa cgc gtc gag ccg ggc 
Ala Asp Trp Val Pro Val Pro Leu Thr Asp Glu Arg Val Glu Pro Gly 
4465 4470 4475 4480 



13440 



ace ggc ccg gag ggc gag ccg ctg egg aeg tac gcg gat ctg gat tec 
Thr Gly Pro Glu Gly Glu Pro Leu Arg Thr Tyr Ala Asp Leu Asp Ser 
4485 4490 4495 



13488 



Ctg gag ggc gcg gcc gtg ccc ggg aeg gtc ctg gtc gcg ccg ect tec 
Leu Glu Gly Ala Ala Val Pro Gly Thr Val Leu Val Ala Pro Pro Ser 
4500 4505 4510 



13536 



ggc get gee ggg aeg gtg gag tee gta cac gcc gcg acc gtc tgg gcg 
Gly Ala Ala Gly Thr Val Glu Ser Val His Ala Ala Thr Val Trp Ala 
4515 4520 4525 



13584 



ctg gag atg gtg cag gcg tgg ctg gee gac gac egg ttc gee ace teg 
Leu Glu Met Val Gin Ala Trp Leu Ala Asp Asp Arg Phe Ala Thr Ser 
4530 4535 4540 



13632 



ega ctg gtg ttc gtc acc cgc ggc gcg gcc ttc ggc gcg gat ett gcg 
Arg Leu Val Phe Val Thr Arg Gly Ala Ala Phe Gly Ala Asp Leu Ala 
4545 4550 4555 4560 



13680 



gcg gcc gcc gtc egg ggc ctg gtg cgc teg gca cag teg gag aac ccg 
Ala Ala Ala Val Arg Gly Leu Val Arg Ser Ala Gin Ser Glu Asn Pro 
4565 4570 4575 



13728 



ggc cgc ttc ggc ctg gtg gac atg gac ggc gac gee gat aeg acc gta 
Gly Arg Phe Gly Leu Val Asp Met Asp Gly Asp Ala Asp Thr Thr Val 
4580 4585 4590 



13776 



ccg gcg caa gcg etc gcg acc gac gag ccc gaa ctg ctg gtg cgt ggt 
Pro Ala Gin Ala Leu Ala Thr Asp Glu Pro Glu Leu Leu Val Arg Gly 
4595 4600 4605 



13824 



ggt gag gtg ctg gcg gcc egg ctg gtc egg gcg cag tec teg cac aeg 
Gly Glu Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser His Thr 
4610 4615 4620 



13872 



gtg aeg tgg gat ccg tec ggt aeg gtc ccg ate ace ggc ggg acc ggt 13920 
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Val Thr Trp Asp Pro Ser Gly Thr Val Leu lie Thr Gly Gly Thr Gly 
4625 4630 4635 ' 4640 

ggg ctg ggc cgt agt gtc gcc egg cac ttg gtg age gag cac ggg gtg 13 96 8 
Gly Leu Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val 
4645 4650 4655 

cgc agt ctg ctg ctg gtc age cgc cgt ggt ccc gcg gcc gag ggt gcc 14016 
Arg Ser Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Ala 
4660 4665 4670 

ggg gag ttg gtg gcc gaa etc agg ggc agt ggc gcc gag gtg gtc ate 14064 
Gly Glu Leu Val Ala Glu Leu Arg Gly Ser Gly Ala Glu Val Val lie 
4675 4680 4685 

gag get tgt gat gtg ace gat gcg gtg gcg gtg gcc gat ctg gtg get 14112 
Glu Ala Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala 
4690 4695 4700 

egg cat egg ate agt get gtg gtg eat aeg gee ggt gtt ctg gat gac 14160 
Arg His Arg lie Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp 
4705 4710 4715 4720 

ggt gtg gtg gag teg ctg aeg eeg gag egg ett gcg gtg gtg ttg cgt 14208 
Gly Val Val Glu Ser Leu Thr Pro Glu Arg Leu Ala Val Val Leu Arg 
4725 4730 4735 

eeg aag gtg gat gcg gcc tgg aac ctg cac gag gcg ace agg ggt ctg 14256 
Pro Lys Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu 
4740 4745 4750 

gat ctg gat gcg ttt gtg gtg ttc teg tec gtg gca ggc act ttc ggc 14304 
Asp Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly 
4755 4760 4765 

agt gcg ggt cag gee aat tac gcg gcg ggt aat get ttc ctg gac gcg 14 352 
Ser Ala Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala 
4770 4775 4780 

ctg gcg tat cac cgt egg gcg gtg ggt ctg eeg gcg gtg teg ctg gcg 14400 
Leu Ala Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala 
4785 4790 4795 4800 

^99 99^ cet tgg teg cag gac ggt ggt atg acc ggc ace ttg age gac 14448 
Trp Gly Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp 
4805 4810 4815 

gcc gat gtc cag cgc ate gcc egg cag ggc atg eeg eeg ctg acc gtc 144 96 
Ala Asp Val Gin Arg lie Ala Arg Gin Gly Met Pro Pro Leu Thr Val 
4820 4825 4830 

gag gag ggt ctg gee etc ttc gac gcc gcg etc ggc age gee gaa ccc 14 54 4 
Glu Glu Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro 
4835 4840 4845 

atg gca etc ccg gtc cgc ctg gac ctg gcg gcg ctg egg gca caa ggc 14592 
Met Ala Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly 
4850 4855 4860 

gag ccc cag cea ctg ctg cgc ggc etc ate egg aeg agg acc cgc egg 1464 0 
Glu Pro Gin Pro l^u Leu Arg Gly Leu lie Arg Thr Arg Thr Arg Arg 
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4880 



tec ggc gcc gcc gcg gca tec ggc ate gcg eag cge ctt gee ggg etg 
Ser Gly Ala Ala Ala Ala Ser Gly lie Ala Gin Arg Leu Ala Gly Leu 
4885 4890 4895 



14688 



tec acg gcg gag egg cgc gag geg etg etc gat gtc gta egg gcc eag 
Ser Thr Ala Glu Arg Arg Glu Ala Leu Leu Asp Val Val Arg Ala Gin 
4900 4905 4910 



14736 



ate geg aeg gtc etg ggc cae gee ggc eeg gaa aeg ate gee cct gae 
He Ala Thr Val Leu Gly His Ala Gly Pro Glu Thr He Ala Pro Asp 
4915 4920 4925 



14784 



egg gee ttc eag gae etc ggc etc gae tee etg aeg geg ate gaa etc 
Arg Ala Phe Gin Asp Leu Gly Leu Asp Ser Leu Thr Ala He Glu Leu 
4930 4935 4940 



14832 



egt aae etg etc ggc aag gee ace ggg etg egg etc ecg gca aeg ace 
Arg Asn Leu Leu Gly Lys Ala Thr Gly Leu Arg Leu Pro Ala Thr Thr 
4945 4950 4955 4960 



14880 



gtg ttc gae tac ccg acg gtg gat gcc etg gcc gcc cae etc ttg gae 
Val Phe Asp Tyr Pro Thr Val Asp l^la Leu Ala Ala His Leu Leu Asp 
4965 4970 4975 



14928 



gaa etg ttc ggc gcg gag aeg ggg acc gcg acg gag acg cce etc ccg 
Glu Leu Phe Gly Ala Glu Thr Gly Thr Ala Thr Glu Thr Pro Leu Pro 
4980 4985 4990 



14976 



gtg cce ggc etg ccg tec etg geg gae gat ccg gtc gtg ate gtc ggc 
Val Pro Gly Leu Pro Ser Leu Ala Asp Asp Pro Val Val He Val Gly 
4995 5000 5005 



15024 



atg age tgc cgc ttc ecc ggc ggc gtc gee teg ccg gag gae etg tgg 
Met Ser Cys Arg Phe Pro Gly Gly Val Ala Ser Pro Glu Asp Leu Trp 
5010 5015 5020 



15072 



cgc etg gtg gcg gae ggc gtg gae gcc gtc tec gcc ttc ccg acc gae 
Arg Leu Val Ala Asp Gly Val Asp Ala Val Ser Ala Phe Pro Thr Asp 
5025 5030 5035 5040 



15120 



^SS 99^ t:gg gag ate gae gae ace tac gae cce gag egg gag ggc gcc 
Arg Gly Trp Glu He Asp Asp Thr Tyr Asp Pro Glu Arg Glu Gly Ala 
5045 5050 5055 



15168 



ate gee acc egt tec ggt gga ttc etc cae gae gcg gcg gag ttc gae 
He Ala Thr Arg Ser Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp 
5060 5065 5070 



15216 



cce gag ttc ttc ggg atg age ccg cge gag gcc etg acc acc gae gcc 
Pro Glu Phe Phe Gly Met Ser Pro Arg Glu Ala Leu Thr Thr Asp Ala 
5075 5080 5085 



15264 



eag eag egg etg ttg etg gag acg ace tgg gag geg etg gag cgc gee 
Gin Gin Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Leu Glu Arg Ala 
5090 5095 5100 



15312 



ggt atg gae eeg gcc aeg etc cgc ggc age cge acg ggt gtc ttc gcc 
Gly Met Asp Pro Ala Thr Leu Arg Gly Ser Arg Thr Gly Val Phe Ala 
5105 5110 5115 5120 



15360 
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gsrc gtg atg tac cac gac tac teg acg ctg etc tee ggg egc gag tte 15408 
Gly Val Met Tyr His Asp Tyr Ser Thr Leu Leu Ser Gly Arg Glu Phe 
5125 5130 * 5135 

gag ggc tac cag ggc age ggc age gea ggc agt gtg gee teg ggc egg 154 56 
Glu Gly Tyr Gin Gly Ser Gly Ser Ala Gly Ser Val Ala Ser Gly Arg 
5140 5145 5150 

gtc teg tac ace tte ggt tte gag ggt ccg gcg gtc acg gtg gac acg 15504 
Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
5155 5160 5165 

geg tgc teg teg tec ctg gtc gee ctg cac ctg gca gea cag teg ctg 15552 
Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gin Ser Leu 
5170 5175 5180 

cgc teg ggc gag tgc teg ctg gcg etc geg ggc ggt gtg acg gtg atg 15600 
Arg Ser Gly Glu Cys Ser Leu Ala Leu Ala Gly Gly Val Thr Val Met 
5185 5190 5195 5200 

tec aca ccg ctg acc tte gtg gag tte tec cgc cag ggc gga ctg teg 1564 8 
Ser Thr Pro Leu Thr Phe Val Glu Phe Ser Arg Gin Gly Gly Leu Ser 
5205 5210 5215 

gcg gac ggc cgc tgc aag gcg tte gee gat gcg gee gac ggc gtc ggc 156 96 
Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly 
5220 5225 5230 

tgg gee gaa ggc gee gga ate ctg gtg ctg gag egt ctg teg gac gee 15744 
Trp Ala Glu Gly Ala Gly lie Leu Val Leu Glu Arg Leu Ser Asp Ala 
5235 5240 5245 

cgc cgc aac ggg cac cgc ate etc geg acg gtg cgc ggc agt gcg gtg 157 92 
Arg Arg Asn Gly His Arg lie Leu Ala Thr Val Arg Gly Ser Ala Val 
5250 5255 5260 

aac cag gac ggt gcg tec aat ggt ctg acg gcg ccg aac ggt ecc gee 15840 
Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
5265 5270 5275 5280 

cag cag egg gtg ate egg cag gcg ctg gcg agt gcg ggg ctg teg gcg 15888 
Gin Gin Arg Val lie Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala 
5285 5290 5295 

gcg gat gtg gac gcg gtg gag gcg cac ggt acg ggt acg acg ctg ggc 15936 
Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly 
5300 5305 5310 

gat ccg ate gag gee cag gcg ctg etc gcg acg tat ggc cag gag egg 15984 
Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg 
5315 5320 5325 

ccg gag gac egg ccg ttg ctg etc ggc tec gtg aag tec aac ate ggt 16032 
Pro Glu Asp Arg Pro Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly 
5330 5335 5340 

cac gcg caa gcg get teg ggt gtt gee ggt gtc ate aag atg gtg ctg 16080 
His Ala Gin Ala Ala Ser Gly Val Ala Gly Val lie Lys Met Val Leu 
5345 5350 5355 5360 
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gcg atg egg cac ggt gtg ctg cct egg acg ctg cat gtc gac gag ccg 
Ala Met Arg His Gly Val Leu Pro Arg Thr Leu His Val Asp Glu Pro 
5365 5370 5375 
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16128 



teg teg cat gte gac tgg age gee ggt gee gtc gag ctg ctg ace tee 
Ser Ser His Val Asp Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser 
5380 5385 5390 



16176 



gag gee gag tgg ccg cag ggc gag ggg ccg cge cgc gcg ggc gtc tee 
Glu Ala Glu Trp Pro Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser 
5395 5400 5405 



16224 



tec ttc ggc ate agt ggg acg aac gcg cat gtg ate ctg gag cag cec 
Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Pro 
5410 5415 5420 



16272 



gaa ccg gtc gcg gcg gaa acg gaa teg ate acg cec gac ace gea ccg 
Glu Pro Val Ala Ala Glu Thr Glu Ser lie Thr Pro Asp Thr Ala Pro 
5425 5430 5435 5440 



16320 



gac gee gee gag gac gag gcg gee gat tee ggg acg ccg gtg ccg gea 
Asp Ala Ala Glu Asp Glu Ala Ala Asp Ser Gly Thr Pro Val Pro Ala 
£>445 5450 5455 



16368 



Ctg ctg tee ggc agg age gea teg gcg ctg egg gee cag gea gea cga 
Leu Leu Ser Gly Arg Ser Ala Ser Ala Leu Arg Ala Gin Ala Ala Arg 
5460 5465 5470 



16416 



Ctg ctg tec cga etc gac ggc gat ccg ggg ccg egg ate act gac gtc 
Leu Leu Ser Arg Leu Asp Gly Asp Pro Gly Pro Arg He Thr Asp Val 
5475 5480 5485 



16464 



gee tac tee etc gcg ace ggc cgt teg gee ttc ccg cac cge gcg gtg 
Ala Tyr Ser Leu Ala Thr Gly Arg Ser Ala Phe Pro His Arg Ala Val 
5490 5495 5500 



16512 



ate etc gee gcg aac cga gcg gac ctg ctg cac teg ctg tec gee ctg 
He Leu Ala Ala Asn Arg Ala Asp Leu Leu His Ser Leu Ser Ala Leu 
5505 5510 5515 5520 



16560 



gcc gag ggc cac acc gag gcg ccg gee gta gtc gea cag gac cga gee 
Ala Glu Gly His Thr Glu Ala Pro Ala Val Val Ala Gin Asp Arg Ala 
5525 5530 5535 



16608 



cgc teg ggc aag ctg gcc ttc ctg ttc teg ggg cag gga teg caa cge 
Arg Ser Gly Lys Leu Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg 
5540 5545 5550 



16656 



ctg ggc atg gga egg gag ttg tac ggt cgc tac ccg gcg ttc gcc gag 
Leu Gly Met Gly Arg Glu Leu Tyr Gly Arg Tyr Pro Ala Phe Ala Glu 
5555 5560 5565 



16704 



gcc etc gac gcg gtg tgc gcc gcc ctg gac gcc cac ctg gac cgt cec 
Ala Leu Asp Ala Val Cys Ala Ala Leu Asp Ala His Leu Asp Arg Pro 
5570 5575 5580 



16752 



ctg egg gac gtc arc tgg ggc gag gac gcg gaa ctg ctg aac egg acc 
Leu Arg Asp Val He Trp Gly Glu Asp Ala Glu Leu Leu Asn Arg Thr 
5585 5590 5595 5600 



16800 



ggg tac gcc cag aca ggg ctg ttc gcc ate gag gtg gcc ctg tec cge 16 84 8 
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Gly Tyr Ala Gin Thr Gly Leu Phe Ala lie Glu Val Ala Leu Phe Arg 
5605 5610 5615 



ctg ctg gag teg tgg ggc gta cgc ccg gac cac ctg ctg ggg cac tec 
Leu Leu Glu Ser Trp Gly Val Arg Pro Asp His Leu Leu Gly His Ser 
5620 5625 5630 



16896 



ate gga gaa ate gee gcg gee eat gtg gcg gge gte ete tec etc ccg 
lie Gly Glu lie Ala Ala Ala His Val Ala Gly Val Leu Ser Leu Pro 
5635 5640 5645 



16944 



gac gcc tgt gcg ctg gtg gcg gcc cga ggt egg ctg atg cag caa ctg 
Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu 
5650 5655 5660 



16992 



ccg tee ggc ggc gcg atg atg gcg ate egg gcg ace gag gac gag gte 
Pro Ser Gly Gly Ala Met Met Ala lie Arg Ala Thr Glu Asp Glu Val 
5665 5670 5675 5680 



17040 



ett eeg eat etg gcg gaa gge gte teg etc gcg gcg gte aac ggg ccg 
Leu Pro His Leu Ala Glu Gly Val Ser Leu Ala Ala VaJ- Asn Gly Pro 
5685 5690 5695 



17088 



teg teg gte gtg gte tec ggc gcc gag gac gag gta etc gcc etc gcg 
Ser Ser Val Val Val Ser Gly Ala Glu Asp Glu Val Leu Ala Leu Ala 
5700 5705 5710 



17136 



gcg cac ttc gag gaa gag gga cgc aag ace ace cga ctg egg. gte teg 
Ala His Phe Glu Glu Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
5715 5720 5725 



17184 



cac gcc ttc cac tee eeg etc atg gaa ccg atg etg gee gac ttc egg 
His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu Ala Asp Phe Arg 
5730 5735 5740 



17232 



gcc gte gcc gac ggc atg acc tac gcc gcg eeg cgc ate ecc gtg gte 
Ala Val Ala Asp Gly Met Thr Tyr Ala Ala Pro Arg lie Pro Val Val 
5745 5750 5755 5760 



17280 



tec aac gte acc ggc egg ccc gcc acc gcg gaa gag ctg tgc tge gcc 
Ser Asn Val Thr Gly Arg Pro Ala Thr Ala Glu Glu Leu Cys Cys Ala 
5765 5770 5775 



17328 



gag tac tgg gte ggc cac gta cgc gag gcc gta egg ttc gcc gac ggg 
Glu Tyr Trp Val Gly His Val Arg Glu Ala Val Arg Phe Ala Asp Gly 
5780 5785 5790 



17376 



gte gge gcg etc cgc gag cag ggt gtg aeg aeg ttc ctg gaa etc ggc 
Val Gly Ala Leu Arg Glu Gin Gly Val Thr Thr Phe Leu Glu Leu Gly 
5795 5800 5805 



17424 



ecc gac gge tet ete tec gcg ete gee gee gaa tec gee gcc gac gac 
Pro Asp Gly Ser Leu Ser Ala Leu Ala Ala Glu Ser Ala Ala Asp Asp 
5810 5815 5820 



17472 



tec gta ctg gcc ccc gta ctg cgc aag aac cgc ccc gag gea ccg gea 
Ser Val Leu Ala Pro Val Leu Arg Lys Asn Arg Pro Glu Ala Pro Ala 
5825 5830 5835 5840 



17520 



etg ete aeg gee etg gea cga etg cac gee cag gge aeg ccg gte gac 
Leu Leu Thr Ala Leu Ala Arg Leu His Ala Gin Gly Thr Pro Val Asp 



17568 
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5845 



5850 



5855 



tgg tec gcc gcc ttc gcc ggt acg ggt gcg egg tgg gtg gac ctg ccg 
Trp Ser Ala Ala Phe Ala Gly Thr Gly Ala Arg Trp Val Asp Leu Pro 
5860 5865 5870 



17616 



acg tac gca ttc cag cac gag egg ttc tgg ccg teg ggc ggg gcg gcg 
Thr Tyr Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Ala Ala 
5875 5880 5885 



17664 



cgc gca ggc gat gtg egg tee gcg ggc etg ggc teg gee ggg cac ccg 
Arg Ala Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro 
5890 5895 5900 



17712 



ctg etg ggt get gcg gtg gaa ctg gcg ggc tec ggc ggg egg ttg etc 
Leu Leu Gly Ala TQa Val Glu Leu Ala Gly Ser Gly Gly Tlrg Leu Leu 
5905 5910 5915 5920 



17760 



ace ggg egg ctg tec ctg tec teg cac ccg tgg etg gcg gat cac gtg 
Thr Gly Arg Leu Ser Leu Ser Ser His Pro Trp Leu Ala Asp His Val 
5925 5930 5935 



17808 



gtg etg ggc tee gta ctg gtg cec ggc acg gcg etc atg gaa etg gtg 
Val Leu Gly Ser Val Leu Val Pro Gly Thr Ala Leu Met Glu Leu Val 
5940 5945 5950 



17856 



ctg egg gcg gcc gac gag gtg gac tgc gcc gcg gtg gae gaa etc acg 
Leu Arg Ala Ala Asp Glu Val Asp Cys Ala Ala Val Asp Glu Leu Thr 
5955 5960 5965 



17904 



etc gee gcg cca etg gtc ctg cec gee teg ggc gee gcg ate cag gta 
Leu Ala Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala lie Gin Val 
5970 5975 5980 



17952 



cag gta tgg gtg ggc gag cec gat gag gcg ggc cgc egg ccg gtc teg 
Gin Val Trp Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser 
5985 5990 5995 6000 



18000 



gtc cat gca cgc gag ggc gag ggc cca tgg acg ctg cac gee gac ggc 
Val His Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Asp Gly 
6005 6010 6015 



18048 



gcc ctg gee ccg gcg gee gag acg gtg ccg ttc gat ace gcg ata tgg 
Ala Leu Ala Pro Ala Ala Glu Thr Val Pro Phe Asp Thr Ala lie Trp 
6020 6025 6030 



18096 



cec ccg cag ggt gcc gag cac ctg gae gcg gcg ggc tgt tac gag egg 
Pro Pro Gin Gly Ala Glu His Leu Asp Ala Ala Gly Cys Tyr Glu Arg 
6035 6040 6045 



18144 



ttc gcg gac gee gga ttc gcg tac ggc ccg gtg ttc cag ggc etg egg 
Phe Ala Asp Ala Gly Phe Ala Tyr Gly Pro Val Phe Gin Gly Leu Arg 
6050 6055 6060 



18192 



gcg gcc tgg aag etc ggc gag gac ate tac gcc gag gtc gca etc cec 
Ala Ala Trp Lys Leu Gly Glu Asp He Tyr Ala Glu Val Ala Leu Pro 
6065 6070 6075 6080 



18240 



gaa ggc acg gae ggc aac gcc tac ggc ctg cac cec gca etc ttc gae 
Glu Gly Thr Asp Gly Asn Ala Tyr Gly Leu His Pro Ala Leu Phe Asp 
6085 6090 6095 



18288 
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gcc gcg ctg cac gca gcg etc ctg ggc ggc gag gga acg gac gaa gcc 
Ala Ala Leu His Ala Ala Leu Leu Gly Gly Glu Gly Thr Asp Glu Ala 
6100 6105 6110 



18336 



gcg gtc ccc ttc tec tgg aac ggg gtg acg etc cac gcc acc ggc get 
Ala Val Pro Phe Ser Trp Asn Gly Val Thr Leu His Ala Thr Gly Ala 
6115 6120 6125 



18384 



tec egg gtg agg gta egc ate cgt ccc ace gaa ggc ggt acg teg ata 
Ser Arg Val Arg Val Arg lie Arg Pro Thr Glu Gly Gly Thr Ser lie 
6130 6135 6140 



18432 



gcc etc gtg gac acc gcc ggt gcg eeg gtc gcc teg gtg ega tee ctg 
Ala Leu Val Asp Thr Ala Gly Ala Pro Val Ala Ser Val Arg Ser Leu 
6145 6150 6155 6160 



18480 



ace gca cgt eeg ate ace gcc ggg eag ttg cag ace ggt gac egc gat 
Thr Ala Arg Pro lie Thr Ala Gly Gin Leu Gin Thr Gly Asp Arg Asp 
6165 6170 6175 



18528 



tee ett ttc cag gtc gac tgg ace acc etc cac etc acg gac gag egc 
Ser Leu Phe Gin Val Asp Trp Thr Thr Leu His Leu Thr Asp Glu Arg 
6180 6185 6190 



18576 



gcg aac tec etc gcc ctg etc ggc aag gac acc gag ggc ate etc gac 
Ala Asn Ser Leu Ala Leu Leu Gly Lys Asp Thr Glu Gly lie Leu Asp 
6195 6200 6205 



18624 



aca etc tec etc eag ccc cac gcg gac etc gac gac etc gcg gcg acg 
Thr Leu Ser Leu Gin Pro His Ala Asp Leu Asp Asp Leu Ala Ala Thr 
6210 6215 6220 



18672 



ggc gtc cac gac acc gtg etc gcc eeg ctg ccc acc egg ace gcc gga 
Gly Val His Asp Thr Val Leu Ala Pro Leu Pro Thr Arg Thr Ala Gly 
6225 6230 6235 6240 



18720 



acg gtg gaa tec gtc cat gcc gcc acg aca ggg gca ctg gee ctg ate 
Thr Val Glu Ser Val His Ala Ala Thr Thr Gly Ala Leu Ala Leu lie 
6245 6250 6255 



18768 



egg tec tgg ctg gee gac gac egg ttc gcc gcc teg egc ctg gtg ttc 
Arg Ser Trp Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe 
6260 6265 6270 



18816 



gtg acg cgt ggc gcg gtg tec ggc acg gat etc gcg ggt gcg teg gtg 
Val Thr Arg Gly Ala Val Ser Gly Thr Asp Leu Ala Gly Ala Ser Val 
6275 6280 6285 



18864 



tgg ggc ctg gtg egg teg gcg ttg ttg gag cac eeg ggc egc ttc ggt 
Trp Gly Leu Val Arg Ser Ala Leu Leu Glu His Pro Gly Arg Phe Gly 
6290 6295 6300 



18912 



ctg gtg gac gtg gac gtg gac caa gac get gaa gtg eeg ett gtg eca 
Leu Val Asp Val Asp Val Asp Gin Asp Ala Glu Val Pro Leu Val Pro 
6305 6310 6315 6320 



18960 



agg gcg ttg gcg teg gat gaa eeg eag gtg ttg gtg cgt ggt ggt gag 
Arg Ala Leu Ala Ser Asp Glu Pro Gin Val Leu Val Arg Gly Gly Glu 
6325 6330 6335 



19008 
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gtg ctg gcg gcc egg ctg gtc egg gcg cag tec teg gae aeg gtg aeg 19056 
Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser Asp Thr Val Thr 
6340 6345 6350 

tgg gat ceg tee ggt aeg gtc ctg ate ace ggc ggg ace ggt ggg ctg 19104 
Trp Asp Pro Ser Gly Thr Val Leu lie Thr Gly Gly Thr Gly Gly Leu 
6355 6360 6365 

ggt cgt agt gtc gcc egg cac ttg gtg age gag cac ggg gtg cgc agt 19152 
Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val Arg Ser 
6370 6375 6380 

ctg ctg ctg gtc age cgc cgt ggt ccc gee gee gag ggt gtc gat gea 19200 
Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Val Asp Ala 
6385 6390 6395 6400 

etc gtt gcc gaa ctt gee gag tgc ggc gcg cag gtc ace gtc gag get 19248 
Leu Val Ala Glu Leu Ala Glu Cys Gly Ala Gin Val Thr Val Glu Ala 
6405 6410 6415 

tgt gat gtg act gae gcg gtg gcg gtg gcc gat ctg gtg get egg cat 19296 
Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala Arg His 
6420 6425 6430 

egg ate agt get gtg gtg eat aeg gee ggt gtt ctg gat gae ggt gtg 19344 
Arg lie Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp Gly Val 
6435 6440 6445 

gtg gag teg ctg aeg ceg gag egg ctg teg gcg gtg ctg cgt ceg aag 19392 
Val Glu Ser Leu Thr Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys 
6450 6455 6460 

gtg gat gcg gcc tgg aac ctg cac gag gcg ace agg ggt ctg gat ctg 1944 0 
Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu Asp Leu 
6465 6470 6475 6480 

gat gcg ttt gtg gtg tte teg tec gtg gea ggc acc ttc ggc agt gcg 19488 
Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly Ser Ala 
6485 6490 6495 

ggt cag gcc aat tac gcg gcg ggt aat get tte ctg gae gcg ctg gcg 19536 
Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala 
6500 6505 6510 

tat cac cgt egg gcg gtg ggt ttg ceg gcg gtg teg ctg gcg tgg ggc 19584 
Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
6515 6520 6525 

cet tgg teg cag gae ggt ggt atg acc ggc acc ttg age gae gcc gat 19632 
Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp Ala Asp 
6530 6535 6540 

gtc cag cgc ate gcc egg cag ggc atg ceg ceg ctg acc gtc gag gag 196 80 
Val Gin Arg lie Ala Arg Gin Gly Met Pro Pro Leu Thr Val Glu Glu 
6545 6550 6555 6560 

ggt ctg gee etc ttc gae gee gcg etc ggc age gee gaa ccc atg gea 19728 
Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro Met Ala 
6565 6570 6575 

etc ccc gtc cgc ctg gae etc gcg gcc eta egg gea caa ggc gag ccc 19776 
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Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly Glu Pro 
6580 6585 6590 

cag cca ctg ctg cgc ggc etc ate egg ace ecg ggt cga cge aeg geg 19824 
Gin Pro Leu Leu Arg Gly Leu lie Arg Thr Pro Gly Arg Arg Thr Ala 
6595 6600 6605 

geg gee geg aeg gag ggc gac ace get gee gee ttc gee ggg cgc ctg 198 72 
Ala Ala Ala Thr Glu Gly Asp Thr Ala Ala Ala Phe Ala Gly Arg Leu 
6610 6615 6620 

ace ggg ctg teg geg gca gaa gga cgc gag gtc gta ctg ggc gee gta 19920 
Thr Gly Leu Ser Ala Ala Glu Gly Arg Glu Val Val Leu Gly Ala Val 
6625 6630 6635 6640 

cge age cag ate geg ggg gtc etc gga cac gee gaa gee aeg gaa ate 19968 
Arg Ser Gin lie Ala Gly Val Leu Gly His Ala Glu Ala Thr Glu lie 
6645 6650 6655 

gac cag gac cgc gee ttc ctg gac etc gga ttc gac tec etc ace geg 2 0016 
Asp Gin Asp Arg Ala Phe Leu Asp Leu Gly Phe Asp Ser Leu Thr Ala 
6660 6665 6670 

gtc gaa etc cgc aac cgc ctg ggc gee gtc ace gga ate cge ctg ecg 20064 
Val Glu Leu Arg Asn Arg Leu Gly Ala Val Thr Gly lie Arg Leu Pro 
6675 6680 6685 

geg ace ctg etc ttc gac tac ecg aeg ecg gca gaa etc gtc gee cac 20112 
Ala Thr Leu Leu Phe Asp Tyr Pro Thr Pro Ala Glu Leu Val Ala His 
6690 6695 6700 

etc cat gee egg ate gca ecg gag ecg ace gtc ggc ecg gag geg etc 2016 0 
Leu His Ala Arg lie Ala Pro Glu Pro Thr Val Gly Pro Glu Ala Leu 
6705 6710 6715 6720 

ctg ggc gaa etc gaa agg atg gag aag tee ttc ggc gga etc gac etc 20208 
Leu Gly Glu Leu Glu Arg Met Glu Lys Ser Phe Gly Gly Leu Asp Leu 
6725 6730 6735 

aeg gag gag atg cac gaa cag ata gee ggc cgt ctg gaa gtc etc egg 202 56 
Thr Glu Glu Met His Glu Gin lie Ala Gly Arg Leu Glu Val Leu Arg 
6740 6745 6750 

gee aag tgg gac gee ctg egg gac aeg gca geg gca gee ggg cac gac 20304 
Ala Lys Trp Asp Ala Leu Arg Asp Thr Ala Ala Ala Ala Gly His Asp 
6755 6760 6765 

ggt tec ecg tec gac gag gac ttc gac ttc gag tec gee tec gac gac 20352 
Gly Ser Pro Ser Asp Glu Asp Phe Asp Phe Glu Ser Ala Ser Asp Asp 
6770 6775 6780 

gag gtc ttc gac etc etc gac aac gaa etc ggc ctg tec tga 203 94 

Glu Val Phe Asp Leu Leu Asp Asn Glu Leu Gly Leu Ser 
6785 6790 6795 



<210> 4 

<211> 6798 

<212> PRT 

<213> Streptomyces natalensis 
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<400> 4 

Met Ser Asn Glu Glu Lys Leu Arg Glu Tyr Leu Lys Arg Ala lie Ala 
15 10 15 

Asp Leu His Glu Thr Arg Gin Gin Leu Asp Glu Thr Glu Ala Lys Gin 
20 25 30 

Arg Glu Pro Leu Ala lie Val Ser Met Ala Cys Arg Phe Pro Gly Gly 
35 40 45 

Val Arg Ser Pro Glu Glu Leu Trp Glu Leu Leu Arg Asp Gly Val Asp 
50 55 60 

Ala Val Ser Ser Phe Pro Arg Asn Arg Gly Trp Asp Leu Asp Ala Leu 
65 70 75 80 

Tyr His Ser Asp Pro Ala His Gin Gly Thr Ser Tyr Ala Arg Glu Gly 
85 90 95 

Gly Phe Leu His Asp Ala Gly Glu Phe Asp Pro Gly Phe Phe Gly lie 
100 105 110 

Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Thr Ala Trp Glu Ala Val Glu Arg Ala Gly lie Asp Pro Glu Ser 
130 135 140 

Leu Ala Gly Ser Arg Thr Gly Val Phe Val Gly Thr Gly His Gly Gly 
145 150 155 160 

Tyr Asp Ala Glu Gly Arg Arg Arg Ala Asp Glu Val Gly Gly His Leu ■ 
165 170 175 

Leu Thr Gly Asn His lie Ser lie Ala Ser Gly Arg lie Ser Tyr Val 
180 185 190 

Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
195 200 205 

Ser Leu Val Ala Leu His Leu Ala Met His Ala Leu Arg Arg Asp Glu 

210 215 220 

Cys Ala Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr Pro Gin 
225 230 235 240 

Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg 
245 250 255 

Cys Lys Pro Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Ser Glu Gly 
260 265 270 

Val Gly Leu Leu Leu Val Glu Arg Leu Ser Asp Ala Val Arg Asn Gly 
275 280 285 

Tyr Pro Val Leu Ala Val Leu Lys Gly Ser Ala Val Asn Gin Asp Gly 
290 295 ' 300 

Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val 
305 310 315 320 
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lie Arg Gin Ala Leu Thr Gly Ala Gly Leu Ala Ala Ser Asp lie Asp 
325 330 335 

Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Val Glu 
340 345 350 

Ala His Ala Leu Leu Ala Thr Tyr Gly Gin Gin Arg Ala Ala Asp Arg 
355 360 365 

Pro Cys Gly Leu Gly Ser Met Lys Ser Asn lie Gly His Thr Gin Ala 
370 375 380 

Ala Ala Gly lie Ala Gly Val Met Lys Met Val Leu Ala Met Arg His 
385 390 395 400 

Gly His Leu Pro Arg Thr Leu His Leu Asp Glu Pro Thr Gly His Val 
405 410 415 

Asp Trp Ser Glu Gly Asn Ala Arg Leu Leu Ala Glu Pro Glu Pro Trp 
420 425 430 

Pro Ser Ala Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly lie 

435 440 445 

Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Ala Pro Ala His Glu 
450 455 460 

Ala Glu Pro Ala Pro Glu Pro Ala Ala Arg Pro Gly Ala Leu Pro Trp 
465 470 475 480 

lie Leu Ser Ala Arg Thr Glu Ala Gly Leu Arg Ala Gin Ala Asp Arg 
485 490 495 

Leu Gly Arg His Leu Arg Asp Arg Ala Asp Leu Glu Pro Ala Ala Val 
500 505 510 

Ala His Ala Leu Ala Asp Thr Arg Thr Leu Met Glu His Arg Ala Val 
515 520 525 

Val Val Ala Gly Asp Arg Glu Glu Phe Leu Arg Gly Leu Asp Ala Leu 
530 535 540 

Ala Ala Gly Arg Thr Ala Asn Gly Leu Val Ser Gly Val Ala Val Lys 
545 550 555 560 

Ala Ala Ser Ala Phe Leu Phe Ala Gly Gin Gly Ser Gin Arg Pro Gly 
565 570 575 

Met Gly Arg Glu Leu His Ala Ala His Pro Val Phe Ala Thr Ala Phe 
580 585 590 

Asp Ala Val Cys Ala Glu Leu Asp Pro His Leu Asp Arg Pro Leu Arg 
595 600 605 

Asp lie Val Phe Ala Glu Glu Gly Ser Ala Glu Ala Ala Leu Leu Asp 
610 615 620 

Gin Thr Ala Tyr Thr Gin Ala Ala Leu Phe Ala Leu Glu Thr Ala Leu 
625 630 635 640 

Phe Arg Leu Val Glu Ser Trp Gly Val Ala Pro Arg Phe Val Ala Gly 
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645 650 655 

His Ser lie Gly Glu Leu Thr Ala Ala His Val Ser Gly Val Leu Thr 
660 665 670 

Leu His Asp Ala Ala Arg Leu Val Ala Ala Arg Gly Thr Leu Met Gin 
675 680 685 

Ala Leu Pro Ala Gly Gly Ala Met Val Ala Val Gin Ala Thr Glu Asp 
690 695 700 

Glu lie Arg Glu Arg Leu Ala Gly His Glu Asp His Val Ala Leu Ala 
705 710 715 720 

Ala Ala Asn Gly Pro Asp Ser Thr Val lie Ser Gly Asp Glu Gin Ala 
725 730 735 

Val Thr Glu lie Ala Ala His Trp Glu Ala Gin Gly Arg Arg Thr Lys 
740 745 750 

Arg Leu Arg Val Ser His Ala Phe His Ser Pro His Met Asp Asp Met 
755 760 765 

Leu Glu Asp Phe Arg Arg Val Ala Arg Gly Leu Thr Phe His Ala Pro 
770 775 780 

Arg lie Pro Val Val Ser Thr Val Thr Gly Ala Leu Ala Thr Glu Asp 
785 790 795 800 

Glu Leu Arg Ser Pro Asp Tyr Trp Val Arg Gin Val Arg Glu Thr Val 
605 810 815 

Arg Phe Cys Ala Ala Val Arg Thr Leu Glu Ala Glu Gly Val Thr Thr 
820 825 830 

Phe Val Glu He Gly Thr Gly Gly Val Leu Thr Pro Met Val Gin Asp 
835 840 845 

Cys Leu Thr Thr Leu Glu Glu Pro Val Leu Val Pro Leu Leu Arg Thr 
850 855 860 

Gly Arg Pro Glu Thr Val Ala Leu Thr Glu Gly Val Ala Thr Ala Phe 
865 870 875 880 

Val His Gly Val Pro Val Asp Arg Ser Ala Phe Pro Gly Ala Pro Gly 
885 890 895 

Thr Ser Arg Ala Asp Leu Pro Thr Tyr Ala Phe Gin Arg Gin Trp Tyr 
900 905 910 

Trp Leu Asp Pro Ala Asp His Asp Glu Gly Glu Ala Ala Ala Ala Glu 
915 920 925 

Ala Gly Glu Ala Gly Phe Trp Ala Ala Val Glu Arg Glu Asp Leu Gin 
930 935 940 

Glu Leu Ser Ala Val Leu Ala He Asp Gly Ser Glu Ala Asp Ser Leu 
945 950 955 960 

Gly Ser Phe Leu Pro Thr Leu Ser Ser Trp Arg Arg Gin Arg Arg Thr 
965 970 975 
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Gin Ala Ala Ala Asp Arg Phe Ser Tyr Arg Thr His Trp Ala Pro Arg 
980 ' 985 990 

Thr Ala Ser Gly Gly Pro Thr Ala Thr Gly His Trp Leu Val Val Leu 
995 1000 1005 

Pro Glu Gly Gly Thr Asp Asp Pro Trp Thr Ala Arg Leu Leu Asp Ala 
1010 1015 1020 

Leu Asn Asp Gin Gly Leu His Thr Asp Val Arg Glu Leu Pro Ala Asp 
025 1030 1035 1040 

His Glu Pro Asp Ala Trp Gly Arg His Pro Val Asp Gly Val Leu Cys 
1045 1050 1055 

Leu Leu Ala Leu Asp Glu Arg Pro Thr Arg Ser Cys Pro Pro Tyr Arg 
1060 1065 1070 

Arg Gly Leu Ala Ala Thr Thr Asn Ala Ala Ala Arg Pro Glu Gly Ala 
1075 1080 1085 

Gly lie Gin Ala Pro Leu Trp Cys Val Thr Arg Gly Ala Val Ala Val 
1090 1095 1100 

Asp Arg His Glu Ala Leu Lys Ser Pro Leu Gin Ala Gin Thr Trp Gly 
105 1110 1115 1120 

Leu Gly Arg Val Ala Ala Leu Glu Ser Pro Gin Ser Trp Gly Gly Leu 
1125 1130 1135 

lie Asp Leu Pro Asp Asn Leu Asp Gly Arg Ala Val Ser Ala Leu Leu 
1140 1145 1150 

Ser Thr Leu Ala Gly Glu Glu Asp Gin Val Ala Val Arg Pro Ala Gly 
1155 1160 1165 

Val Phe Ala Arg Arg Leu Glu Arg lie Thr Pro Gly Gly Asp Thr Gly 
1170 1175 1180 

Asp Arg Trp Ser Thr His Gly Thr Val Leu Val Thr Gly Gly Thr Gly 
185 1190 1195 1200 

Ala Leu Gly Ala His Leu Ala His Trp Leu Ala Asp Ala Gly Ala Glu 
1205 1210 1215 

His Leu Val Leu Thr Gly Arg Arg Gly Pro Gin Ala Pro Gly Ala Pro 
1220 1225 1230 

Glu Leu Ala Ala Ala Leu Thr Asp Arg Gly Val Lys Val Thr Leu Ala 
1235 1240 1245 

Ala Cys Asp Ala Ala Asp Arg Asp Ala Leu Ala Ala Val Leu Ala Asp 
1250 1255 1260 

lie Pro Pro His Leu Pro Leu Thr Gly Val Val His Ala Ala Gly Val 
265 1270 1275 1280 

Leu Asp Asp Gly Val Leu Asp Ala Leu Thr Pro Glu Arg Phe Glu Thr 
1285 1290 1295 



46 



wo 00/77222 PCT/EPOO/06227 

Val Leu Arg Pro Lys Ala Arg Ala Ala Gin Asn Leu Kis Glu Leu Thr 
1300 1305 1310 

Gin Asp Leu Asp Leu Asp His Phe Val Leu Phe Ser Ser lie Val Gly 
1315 1320 1325 

Val Leu Gly Asn Ala Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Tyr 
1330 1335 " 1340 

Leu Asp Ala Leu Ala Glu His Arg Leu Ala Gin Gly Leu Pro Ala Thr 
345 1350 1355 1360 

Ser Val Ser Trp Gly Pro Gly Gin Ala Ala Ala Trp His Asp Ser Asp 
1365 1370 1375 

Ala Ala Asp Arg Met Ser Arg Asp Gly Leu Leu Pro Met Ala Ala Ala 
1380 1385 1390 

Pro Arg Arg Arg Pro Ala Pro Ala Leu Ala Gin Gly Met Thr Gin Val 
1395 1400 1405 

Thr Val Ala Asp lie Asp Trp Ser Ala Tyr Ala Pro Ala Leu Thr Ala 
1410 1415 1420 

Val Arg Pro Ser Pro Leu. lie Gly Asp Leu Pro Glu Ala Arg Arg Ala 
425 1430 1435 1440 

Leu Gly Pro Ala Glu Gly Pro Arg Arg Glu Arg Ser Pro Leu Arg Asp 
1445 1450 1455 

Arg lie Gly Ala Leu Pro Pro Ala Glu Gin Glu Lys Ala Phe Leu Thr 
1460 1465 1470 

Met Val Arg Glu Glu Ala Ala Arg Val Leu Gly His Pro Ser Pro Asp 
1475 1480 1485 

Thr Val Asp Ala Gin Arg Ala Phe Arg Glu Gin Gly Phe Asp Ser Leu 
1490 1495 1500 

Met Ala Val Asp Leu Arg Asn Arg Leu Ser Ala Ala Thr Gly Leu Arg 
505 1510 1515 1520 

Leu Pro Ala Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ala Ala Ala 
1525 1530 1535 

Ala Cys Leu Arg Ser Glu Val Leu Gly Ala Ala Gly Pro Ala Thr Val 
1540 1545 1550 

Val Gin Ala Ser Thr Ala Ala Leu Asp Glu Pro Val Ala He He Gly 
1555 1560 1565 

Met Ala Cys Arg Phe Pro Gly Gly Val His Ser Pro Glu Ala Leu Trp 
1570 1575 1580 

Arg Leu Leu Ala Glu Gly Gly Asp Ala He Thr Pro Met Pro Ala Asp 
585 1590 1595 1600 

Arg Gly Trp Asp Leu Asp Arg Leu Tyr His Pro Asp Pro Asp His Gin 
1605 1610 1615 

Gly Thr Ser Tyr Ala Arg Gly Gly Gly Phe Leu Asp Gly Ala Ala Asp 
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1620 1625 1630 

Phe Asp Ala Asp Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met 
1635 1640 1645 

Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Trp Glu Val Leu Glu Gin 
1650 1655 1660 

Ala Gly lie Asp Pro Glu Ser Leu Arg Gly Ser Ser Thr Gly Val Phe 
665 1670 1675 1680 

Ala Gly Thr Asn Thr Gin Asp Tyr Gly Thr Ala Leu Asp Ala Ala Gin 
1685 1690 1695 

Asp Glu Ala Gly Gly His Arg Leu Thr Gly Asn Ala Met Ser Val Val 
1700 1705 1710 

Ser Gly Arg Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Leu Thr 
1715 1720 1725 

Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Met Ala Ala 
1730 1735 1740 

Gin Ala Leu Arg Gin Gly Glu Cys Ser Leu Ala Val Ala Gly Gly Val 
745 1750 1755 1760 

Thr Val Met Ala Thr Pro Ser Ser Phe Val Glu Phe Ala Arg Gin Arg 
1765 1770 1775 

Gly Leu Ala Pro Asp Gly Arg Cys Lys Pro Phe Ala Ala Ala Ala Asp 
1780 1785 1790 

Gly Thr Gly Trp Ser Glu Gly Val Gly Leu Leu Leu Val Glu Arg Leu 
1795 1800 1805 

Ser Asp Ala Arg Arg Asn Gly His Gin Val Leu Ala Val Val Arg Gly 
1810 1815 1820 

Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Ser 
825 1830 1835 1840 

Gly Pro Ser Gin Gin Arg Val lie Arg Gin Ala Leu Ala Asn Ala Arg 
1845 1850 1855 

Val Ala Ala Ser Glu Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
1860 1865 1870 

Thr Leu Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr T>'r Gly 
1875 1880 1885 

Gin Glu Arg Pro Leu Leu Leu Gly Ala Val Lys Ser Asn Leu Gly His 
1890 1895 1900 

Thr Gin Ala Ala Ala Gly Val Ala Gly Val Met Lys Met Val Leu Ala 
905 1910 1915 1920 

Met Arg His Gly Met Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr 
1925 1930 1935 

Gly His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Glu His 
1940 1945 1950 
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Thr Asp Trp Pro Glu Thr Gly His Pro Arg Arg Ala Ala Val Ser Ala 
1955 1960 1965 

Phe Gly He Ser Gly Thr Asn Ala His Val Val Leu Glu Leu Pro Ala 
1970 1975 1980 

Ala Glu Gin Pro Leu Val Glu Gin Pro Ser Ala Ala Glu Pro Asp Ala 
985 1990 1995 2000 

Pro Ala Thr Ala Pro Asp Arg Thr Pro Thr Ala Ser Asp Gly Thr Ala 
2005 2010 2015 

Pro Leu Leu Leu Ser Ala Lys Ser Glu Ser Ala Leu Arg Ala Gin Ala 
2020 2025 2030 

Ala Arg Leu His Ser His Leu Glu Arg Asp Pro Ala Leu Arg Leu Thr 
2035 2040 2045 

Asp Ala Ala Tyr Thr Leu Met Thr His Arg Thr Ala Phe Ala His Arg 
2050 2055 2060 

Ala Ala Val A.rg Ala Ala Asp His Glu Ala Ala Leu Arg Ala Leu Thr 
065 2070 2075 2080 

Ala Leu Ala Ala Gly Glu Ala Asp Pro Ala Val Asp Thr Gly Thr Ala 
2085 2090 2095 

His Thr Gly Arg Asp Ala Val Leu Phe Ser Gly Gin Gly Ser Gin Arg 
2100 2105 2110 

He Gly Met Gly Arg Glu Leu Ser Gly Arg Tyr Pro Val Phe Ala Glu 
2115 2120 2125 

Ala Phe Asp Thr Val Cys Ala Ala Leu Asp Glu His Leu Asp Arg Pro 
2130 2135 2140 

Leu Arg Asp Val Val Arg Gly Glu Asp Glu Glu Leu Leu Asn Arg Thr 
145 2150 2155 2160 

Val Tyr Ala Gin Ala Gly Leu Phe Ala He Glu Val Ala Leu Phe Arg 
2165 2170 2175 

Leu Val Glu Ser Trp Gly Val Arg Pro His Tyr Val Ala Gly His Ser 
2180 2185 2190 

Val Gly Glu He Ala Ala Ala His Val Ala Gly Val Phe Ser Leu Ala 
2195 2200 2205 

Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu 
2210 2215 2220 

Pro Ala Gly Gly Ala Met Ala Ala He Arg Ala Thr Glu Asp Glu Val 
225 2230 2235 2240 

Leu Pro His Leu Ala Asp Ser Val Ser He Ala Ala Val Asn Gly Pro 
2245 2250 2255 

Ser Ser Val Val Val Ser Gly Ala Glu His Ala Val Leu Ser He Ala 
2260 2265 2270 
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Ala His Phe Glu Gly Ala Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
2275 2280 ^ 2285 

His Ala Phe His Ser Pro Leu Met Asp Pro Met Leu Ala Asp Phe Arg 
2290 2295 2300 

Ala Val Ala Glu Gly Leu Thr Tyr Gly Glu Pro Glu Leu Ala Val Val 
305 2310 2315 2320 

Ser Asn Val Thr Gly Gin Leu Ala Thr Pro Asp Gin Leu Arg Thr Pro 
2325 2330 2335 

Glu Tyr Trp Val Thr His Val Arg Ala Ala Val Arg Phe Ala Asp Gly 
2340 2345 2350 

lie Arg Ala Leu Gly Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly 
2355 2360 2365 

Pro Asp Gly Val Leu Ser Ala Leu Ala Arg Glu Ser Ala Pro Asp Asp 
2370 2375 2380 

Ala Val Cys Thr Pro Val Leu Arg Lys Asp Arg Ser Glu Ala Ala Thr 
385 2390 2395 2400 

Leu Leu Ala Ala Leu Thr His Leu His Val His Gly Thr Glu He Asp 
2405 2410 2415 

Trp Thr Ala Phe Leu Ala Gly Arg Asp Ala His Ala Val Asp Leu Pro 
2420 2425 2430 

Thr Tyr Ala Phe Gin His Gin Arg Phe Trp Pro Thr Pro Asp His Thr 
2435 2440 2445 

Arg Thr Gly Asp Leu Gly Ala Val Gly Leu Glu Ala Thr Gly His Pro 
2450 2455 2460 

Leu Leu Ser Ala Ala Val Glu Leu Pro Asp Gly Glu Gly Leu Leu Phe 
465 2470 2475 2480 

Thr Thr Arg Leu Ser Leu Gin Thr His Pro Trp Leu Ala Gly His Val 
2485 2490 2495 

Val Met Gly Ser Val Leu Leu Pro Gly Thr Ala Phe Ala Glu Leu Ala 
2500 2505 2510 

Leu Arg Ala Ala Asp Glu Val Gly Cys Asp Arg Val Asp Glu Leu Thr 
2515 2520 2525 

Leu Ala Ala Pro Leu Val Leu Pro Glu His Gly Gly Val Gin Leu Gin 
2530 2535 2540 

Leu Arg Val Gly Pro Ala Asp Ala Ser Gly Arg Arg Thr Leu Thr Ala 
545 2550 2555 2560 

Arg Ser Arg Ala Glu Gly Asp Gly Asp Arg Pro Trp Val Gin His Ala 
2565 2570 2575 

Thr Gly Val Leu Ala Glu Gly Glu Ser Thr Pro Glu Pro Gly Tyr Asp 
2580 2585 2590 

Phe His Thr Glu Ser Trp Pro Pro Ala Asp Ala Ala Pro Val Glu Leu 
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2595 2600 2605 

Ser Gly Leu Tyr Pro Asp Phe Ala Ala His Gly Phe Asp Tyr Gly Pro 
2610 2615 2620 

His Phe Gin Gly Leu Arg Thr Ala Trp Arg Arg Gly Asp Glu Val Phe 
625 2630 2635 2640 

Ala Glu Val Ala Leu Pro Ala Glu Ala Glu Gly Glu Ala Ser Ala Tyr 
2645 2650 2655 

Gly Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Val Val Ala Phe 
2660 2665 2670 

Asn Gly Val Asp Arg Gly Val Val Pro Phe Ser Trp Glu Ser Val Ala 
2675 2680 2685 

Leu His Ala Thr Gly Ala Ser Ala Val Arg lie Arg Val Val Arg His 
2690 2695 2700 

Ser Gly Asp Thr Val Ser Val Asp Val Ala Asp Thr Thr Gly Glu Pro 
705 2710 2715 2720 

Val Ala Ser lie Gly Thr Leu Val Leu Arg Ala Val Ser Ala Asp Gin 
2725 2730 2735 

Leu Ala Gly Gly Ala Asp Pro Ala Val Arg Asp Ala Leu Phe Arg Val 
2740 2745 2750 

Gin Trp Asn Pro Val Arg Leu Pro Pro Ala Gly Ala Ala Val Thr Val 
2755 2760 2765 

Ala Thr Leu Gly Ser Leu Ala Gly Ala Pro Phe Asp Gly Tyr Pro Asp 
2770 2775 2780 

Leu Ala Ser Leu Ala Arg Ser Gly Arg Val Ala Gly Ala Val Leu Val 
785 2790 2795 2800 

Pro Val Glu Ala Gly Ala Gly Glu Val Val Ala Asp Asp Val Val Gly 
2805 2810 2815 

Ala Thr His Ala Thr Ala Ala Arg Ala Leu Asp Leu Ala Arg Ser Trp 
2820 2825 2830 

Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe Val Thr Arg 
2835 2840 2845 

Gly Ala Val Ser Gly Ala Asp Leu Ala Gly Ala Ala Val Trp Gly Leu 
2850 2855 2860 

Val Arg Ser Ala Leu Ser Glu His Pro Gly Arg Phe Gly Leu Val Asp 
865 2870 2875 2880 

Leu Asp Asp T^p Ala Glu Leu Ala Leu Val Pro Arg Val Leu Ala Ser 
2885 2890 2895 

Asp Glu Pro Gin Leu Leu Val Arg Gly Gly Glu Val Leu Ala Ala Arg 
2900 2905 2910 

Leu Ala Arg Ala Gin Ser Ser His Ala Val Thr Trp Asp Pro Ser Gly 
2915 2920 2925 
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Thr Val Leu Val Thr Gly Gly Thr Gly Gly Leu Gly Arg Val Met Ala 
2930 2935 2940 

Arg His Leu Val Val Glu His Gly Val Aarg Asn Leu Leu Leu Val Ser 
945 2950 2955 2960 

Arg Arg Gly Pro Ala Ala Glu Gly Ala Glu Glu Leu Val Thr Glu Leu 
2965 2970 2975 

Arg His Ser Gly Ala Glu Val Ala Val Glu Ala Cys Asp Val Thr Asp 
2980 2985 2990 

Ala Ala Ala Val Ala Asp Leu Val Ala Arg His Arg lie Ser Ala Val 
2995 3000 3005 

Val His Thr Ala Gly Val Leu Asp Asp Gly Val Val Glu Ser Leu Thr 
3010 3015 3020 

Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys Val Asp Ala Ala Trp 
025 3030 3035 3040 

Asn Leu His Glu Ala Thr Arg Asp Leu Asp Leu Asp Ala Phe Val Val 
3045 3050 3055 

Phe Ser Ser Val Ala Gly Thr lie Gly Ser Pro Gly Gin Ala Asn Tyr 
3060 3065 3070 

Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala His His Arg Arg Ala 
3075 3080 3085 

Ala Gly Leu Pro Ala Ala Ser Leu Ala Trp Gly Pro Trp Ser Arg Asp 
3090 3095 3100 

Gly Gly Met Thr Gly Thr Leu Thr Asp Val Asp Ser Ser Ala Ser Pro 
105 3110 3115 3120 

Gly Arg His Ala Arg Thr His Pro Arg Thr Gly Arg Gly Leu Phe Asp 
3125 3130 3135 

Ala Ala Leu Ala Ala Gly Asp Ala His Leu Leu Pro Val Arg Phe Asp 
3140 3145 3150 

Trp Ala Ser Leu Arg Ala Gin Gly Glu Val Pro Pro Leu Leu Arg Gly 
3155 3160 3165 

Leu lie Arg Thr Arg Ala Arg Arg Ser Ala Val Gly Gly Ser Ala Ala 
3170 3175 3180 

Ala Ala Gly Leu Val Gly Arg Leu Ser Gly Arg Gly Thr Val Glu Arg 
185 3190 3195 3200 

Arg Glu Val Leu Leu Asp Leu Val Arg Ala Gin lie Ala Val Val Leu 
3205 3210 3215 

Gly His Ala Asn Pro Glu Thr lie Glu Ser Thr Arg Val, Phe Gin Asp 
3220 3225 3230 

Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Asn 
3235 3240 3245 
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Asn Ala Thr Gly Leu Arg Leu Ser Ala Thr Ala Val Phe Asp Tyr Pro 
3250 3255 3260 

Tlir Ala Asp Ala Leu Val Asp Phe Leu Leu Asp Glu Leu Phe Gly Ala 
265 3270 3275 3280 

Gin Glu Glu Ala Glu Leu Pro Ala Pro Val Pro Ser Pro Ala Gly Ala 
3285 3290 3295 

Ala Asp Asp Pro Val Val lie Val Gly Met Ser Cys Arg Tyr Pro Gly 
3300 3305 3310 

Gly Val Gly Ser Pro Glu Asp Leu Trp Arg Leu Val Ser Glu Gly Val 
3315 3320 3325 

Asp Ala Val Ser Asp Phe Pro Thr Asp Arg Gly Trp Asp Val Glu Ser 
3330 3335 3340 

Leu Tyr Ser Pro Asp Pro Glu Ala Leu Gly Thr Ser Tyr Thr Arg Ser 
345 3350 3355 3360 

Gly Gly Phe Leu His Glu Ala Ala Glu Phe Asp Pro Asp Phe Phe Gly 
3365 3370 3375 

Met Ser Pro Arg Glu Ala Leu Ala Thr Asp Ala Gin Gin Arg Leu Leu 
3380 3385 3390 

Leu Glu Thr Thr Trp Glu Ala lie Glu Arg Thr Gly lie Asp Pro Ala 
3395 3400 3405 

Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Val Met Tyr Thr 
3410 3415 3420 

Asp Tyr Gly Asp Leu Leu Val Gly Asp Gin Phe Glu Gly Tyr Arg Ser 
425 3430 3435 3440 

Asn Gly Ser Ala Ala Ser lie Ala Ser Gly Arg Val Ser Tyr Thr Phe 
3445 3450 3455 

Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
3460 3465 3470 

Leu Val Ala Leu His Trp Ala Ala Gin Ser Leu Arg Ser Gly Glu Cys 
3475 3480 3485 

Ser Leu Ala Val Ala Gly Gly Val Thr Val Met Ser Thr Pro Thr Thr 
3490 3495 3500 

Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ser Ala Asp Gly Arg Cys 
505 3510 3515 3520 

Lys Ala Phe Ala Asp Ala Ala Asp Gly Val Gly Trp Gly Glu Gly Val 
3525 3530 3535 

Gly Met Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
3540 3545 3550 

Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
3555 3560 3565 

Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala Gin Gin Arg Val lie 
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3570 3575 3580 



Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala Ala Asp Val Asp Ala 
585 3590 3595 3600 

Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu Ala 
3605 3610 3615 

Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg Pro Glu Asp Arg Pro 
3620 3625 3630 

Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly His Ala Gin Ala Ala 

3635 3640 3645 

Ser Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg His Gly 
3650 3655 3660 



Val Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Ser His Val Asp 
665 3670 3675 3680 

Trp Ser 2Q.a Gly Ala Val Glu Leu Leu Thr Ser Glu Ala Glu Trp Pro 
3685 3690 3695 



Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
3700 3705 3710 

Gly Thr Asn Ala His Val He Leu Glu Gin Pro Gly Pro Asp Ala Ala 
3715 3720 3725 

Asp Ala Ala Pro Asp Ala Thr Val Thr Asp Pro Gly Ala Leu Ala Trp 
3730 3735 3740 

Val Leu Ser Ala Arg Asn Glu Ala Ala Leu Arg Cys Gin Ala Ala Arg 
745 3750 3755 3760 



Leu Leu Ser Leu Val Ala Gly Ser Asp Ala Leu Cys Ala Arg Asp He 
3765 3770 3775 

Gly His Ser Leu Val Thr Gly Arg Ser Ser Phe Ala His Arg Ala Val 
3780 3785 3790 



Val Trp Gly Gin Asp Arg Asp Ala Leu Val Arg Ala Leu Ser Ala Leu 
3795 3800 3805 

Ala Val Gly Glu Ala Asp Ala Gly Leu Ala Glu Gly Ala Ser Gly Ala 
3810 3815 3820 

Gly Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg Leu Gly 
825 3830 3835 3840 

Met Gly Trp Glu Leu Tyr Ala Arg Tyr Pro Val Phe Ala Asp Ala Phe 
3845 3850 3855 

Asp Ala Val Cys Ala Ala Leu Asp Glu His Leu Glu Arg Pro Leu Arg 
3860 3865 3870 

Asp Val Val Trp Gly Glu Asp Ala Glu Leu Leu Asn Gin Thr Ala Tyr 
3875 3880 3885 

Ala Gin Ala Gly Leu Phe Ala He Glu Val Ala Leu Tyr Arg Leu Ala 
3890 3895 3900 
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Glu Ser Trp Gly Met Arg Pro Asp Phe Val Ala Gly His Ser lie Gly 
905 3910 3915 3920 

Glu Val Ala Ala Ala His Val Ser Gly Val Phe Ser Leu Pro Asp Ala 
3925 3930 3935 

Cys Ala Leu Val Ala Al.a Arg Gly Arg Leu Met Gin Gin Leu Pro Ser 
3940 3945 3950 

Gly Gly Ala Met Met Ala lie Arg Ala Thr Glu Asp Glu Val Leu Pro 
3955 3960 3965 

His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro Ser Ser 
3970 3975 3980 

Val Val lie Ser Gly Ala Glu Asp Ala Val Leu Ala He Ala Ala His 
985 3990 3995 4000 

Phe Ala Gly Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser His Ala 
4005 4010 40X5 

Phe His Ser Pro Leu Met Glu Pro Met Leu Glu Glu Phe Arg Ala Val 
4020 4025 4030 

Val Thr Arg Leu Ser Phe Gly Thr Pro Thr He Pro Val Val Ser J\sn 
4035 4040 4045 

Leu Thr Gly Arg Leu Ala Glu Pro Glu Gin Leu Ala His Ala Asp Tyr 
4050 4055 4060 

Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala Asp Gly He Gin 
065 4070 4075 4080 

Ala Leu Arg Ala Glu Gly Val Thr Arg Phe Leu Glu Leu Gly Pro Asp 
4085 4090 4095 

Gly Val Leu Ser Ala Met Ala Arg Glu Ser Ala Ser Asp Asp Ala Val 
4100 4105 4110 

Leu Ala Pro Val Leu Arg Arg Asp Arg Pro Glu Glu Thr Ala Leu Leu 
4115 4120 4125 

Gly Ala Leu Ala Gin Leu Tyr Val Arg Gly Ala His Val Asp Trp Thr 
4130 4135 4140 

Val Pro Phe Ala Gly Ser Gly Ala Arg Trp Ala Asp Leu Pro Thr Tyr 
145 4150 4155 4160 

Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Val Ala Arg Pro 
4165 4170 4175 

Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro L.eu Leu 
4180 4185 4190 

Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Leu Leu Phe Thr Gly 
4195 4200 4205 

Arg Leu Ser Val Ser Ser His Pro Trp Leu Ala Asp His Val Val Leu 
4210 4215 4220 
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Gly Ser Val Leu Val Pro Gly Thr Ala Leu Val Glu Leu Val Leu Arg 
225 4230 4235 4240 

Ala Ala Asp Glu Ala Gly Cys Asp Leu Leu Glu Glu Leu Thr Leu Ala 
4245 4250 4255 

Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala Val Gin Val Gin Val 
4260 4265 4270 

Ala Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser Val His 
4275 4280 4285 

Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Ser Gly Ala Val 
4290 4295 4300 

Thr Ser Gly Ala Glu Val Pro Pro Phe Asp Ala Thr Val Trp Pro Pro 
305 4310 4315 4320 

Lys Gly Ala Glu Pro Val Asp Val Ala Asp Cys Tyr Asp Val Leu Ala 
4325 4330 4335 

Asp Ala Gly Leu Thr Tyr Gly Pro Ala Phe His Gly Leu Gin Ala Ala 
4340 4345 4350 

Trp Lys Leu Gly Gly Asp Val Tyr Ala Glu Ala Lys Leu Pro Glu Ser 
4355 4360 4365 

Thr Asp Gly Asp Ala Tyr Gly Leu His Pro Ala Leu Phe Asp Ala Ala 
4370 4375 4380 

Leu His Ala Ser Ala Leu Gly Gly Ala Glu Ala Gly Gly Val Pro Phe 
385 4390 4395 4400 

Ser Trp Ala Gly Val Ser Leu His Ala Thr Gly Ala Ser His Leu Arg 
4405 4410 4415 

Val Arg He Arg Glu Ala Gly Gly Ala Leu Ser Val Ala He Ala Asp 
4420 4425 4430 

Thr Ser Gly Ala Pro Val Ala Ser Val Glu Ser Leu Val He Arg Pro 
4435 4440 4445 

Leu Ser Ala Gly Gin Val Gin Ala Ala Asp Arg Asp Ala Leu Phe Lys 
4450 4455 4460 

Ala Asp Trp Val Pro Val Pro Leu Thr Asp Glu Arg Val Glu Pro Gly 
465 4470 4475 4480 

Thr Gly Pro Glu Gly Glu Pro Leu Arg Thr Tyr Ala Asp Leu Asp Ser 
4485 4490 4495 

Leu Glu Gly Ala Ala Val Pro Gly Thr Val Leu Val Ala Pro Pro Ser 
4500 4505 4510 

Gly Ala Ala Gly Thr Val Glu Ser Val His Ala Ala Thr Val Trp T^a 
4515 4520 4525 

Leu Glu Met Val Gin Ala Trp Leu Ala Asp Asp Arg Phe Ala Thr Ser 
4530 4535 4540 

Arg Leu Val Phe Val Thr Arg Gly Ala Ala Phe Gly Ala Asp Leu Ala 
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545 



4550 



4555 



4560 



Ala Ala Ala Val Arg Gly Leu Val Arg Ser Ala Gin Ser Glu Asn Pro 
4565 4570 4575 

Gly Arg Phe Gly Leu Val Asp Met Asp Gly Asp Ala Asp Thr Thr Val 
4580 4585 4590 

Pro Ala Gin Ala Leu Ala Thr Asp Glu Pro Glu Leu Leu Val Arg Gly 
4595 4600 4605 

Gly Glu Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser His Thr 
4610 4615 4620 

Val Thr Trp Asp Pro Ser Gly Thr Val Leu lie Thr Gly Gly Thr Gly 
625 4630 4635 4640 

Gly Leu Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val 
4645 4650 4655 

Arg Ser Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Ala 
4660 4665 4670 

Gly Glu Leu Val Ala Glu Leu Arg Gly Ser Gly Ala Glu Val Val lie 
4675 4680 4685 

Glu Ala Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala 
4690 4695 4700 

Arg His Arg He Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp 
705 4710 47X5 4720 

Gly Val Val Glu Ser Leu Thr Pro Glu Arg Leu Ala Val Val Leu Arg 
4725 4730 4735 

Pro Lys Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu 
4740 4745 4750 

Asp Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly Thr Phe Gly 
4755 4760 4765 

Ser Ala Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala 
4770 4775 4780 

Leu Ala Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala 
785 4790 4795 4800 

Trp Gly Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp 
4805 4810 4815 

Ala Asp Val Gin Arg He Ala Arg Gin Gly Met Pro Pro Leu Thr Val 
4820 4825 4830 

Glu Glu Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro 
4835 4840 4845 

Met Ala Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg^ Ala Gin Gly 
4850 4855 4860 

Glu Pro Gin Pro Leu Leu Arg Gly Leu He Arg Thr Arg" Thr Arg Arg 
865 4870 4875 4880 
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Ser Gly Ala Ala Ala Ala Ser Gly lie Ala Gin Arg Leu Ala Gly Leu 
4885 4890 4895 

Ser Thr Ala Glu Arg Arg Glu Ala Leu Leu Asp Val Val Arg Ala Gin 
4900 4905 4910 

lie Ala Thr Val Leu Gly His Ala Gly Pro Glu Thr lie Ala Pro Asp 
4915 4920 4925 

Arg Ala Phe Gin Asp Leu Gly Leu Asp Ser Leu Thr Ala lie Glu Leu 
4930 4935 4940 

Arg Asn Leu Leu Gly Lys Ala Thr Gly Leu Arg Leu Pro Ala Thr Thr 
945 4950 4955 4960 

Val Phe Asp Tyr Pro Thr Val Asp Ala Leu Ala Ala His Leu Leu Asp 
4965 4970 4975 

Glu Leu Phe Gly Ala Glu Thr Gly Thr Ala Thr Glu Thr Pro Leu Pro 
4980 4985 4990 

Val Pro Gly Leu Pro Ser Leu Ala Asp Asp Pro Val Val lie Val Gly 
4995 5000 • 5005 

Met Ser Cys Arg Phe Pro Gly Gly Val Ala Ser Pro Glu Asp Leu Trp 
5010 5015 5020 

Arg Leu Val Ala Asp Gly Val Asp Ala Val Ser Ala Phe Pro Thr Asp 
025 5030 5035 5040 

Arg Gly Trp Glu lie Asp Asp Thr Tyr Asp Pro Glu Arg Glu Gly Ala 
5045 5050 5055 

He Ala Thr Arg Ser Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp 
5060 5065 5070 

Pro Glu Phe Phe Gly Met Ser Pro Arg Glu Ala Leu Thr Thr Asp Ala 
5075 5080 5085 

Gin Gin Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Leu Glu Arg Ala 
5090 5095 5100 

Gly Met Asp Pro Ala Thr Leu Arg Gly Ser Arg Thr Gly Val Phe Ala 
105 5110 5115 5120 

Gly Val Met Tyr His Asp Tyr Ser Thr Leu Leu Ser Gly Arg Glu Phe 
5125 5130 5135 

Glu Gly Tyr Gin Gly Ser Gly Ser Ala Gly Ser Val Ala Ser Gly Arg 
5140 5145 5150 

Val Ser Tyr Thr Phe Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr 
5155 5160 5165 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gin Ser Leu 
5170 5175 5180 

Arg Ser Gly Glu Cys Ser Leu Ala Leu Ala Gly Gly Vai Thr Val Met 
185 5190 5195 5200 
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Ser Thr Pro Leu .Thr Phe Val Glu Phe Ser Arg Gin Gly Gly Leu Ser 
5205 5210 5215 

Ala Asp Gly Arg Cys Lys Ala Phe Ala A^p Ala Ala Asp Gly Val Gly 
5220 5225 5230 

Trp Ala Glu Gly Ala Gly lie Leu Val Leu Glu Arg Leu Ser Asp Ala 
5235 5240 5245 

Arg Arg Asn Gly His Arg lie Leu Ala Thr Val Arg Gly Ser Ala Val 
5250 5255 5260 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
265 5270 5275 5280 

Gin Gin Arg Val lie Arg Gin Ala Leu Ala Ser Ala Gly Leu Ser Ala 
5285 5290 5295 

Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly 
5300 5305 5310 

Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg 
5315 5320 5325 

Pro Glu Asp Arg Pro Leu Leu Leu Gly Ser Val Lys Ser Asn lie Gly 
5330 5335 5340 

His Ala Gin Ala Ala Ser Gly Val Ala Gly Val lie Lys Met Val Leu 
345 5350 5355 5360 

Ala Met Arg His Gly Val Leu Pro Arg Thr Leu His Val Asp Glu Pro 
5365 5370 5375 

Ser Ser His Val Asp Tirp Ser Ala Gly Ala Val Glu Leu Leu Thr Ser 
5380 5385 5390 

Glu Ala Glu Trp Pro Gin Gly Glu Gly Pro Arg Arg Ala Gly Val Ser 
5395 5400 5405 

Ser Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Gin Pro 
5410 5415 5420 

Glu Pro Val Ala Ala Glu Thr Glu Ser lie Thr Pro Asp Thr Ala Pro 
425 5430 5435 5440 

Asp Ala Ala Glu Asp Glu Ala Ala Asp Ser Gly Thr Pro Val Pro Ala 
5445 5450 5455 

Leu Leu Ser Gly Arg Ser Ala Ser Ala Leu Arg Ala Gin Ala Ala Arg 
5460 5465 5470 

Leu Leu Ser Arg Leu Asp Gly Asp Pro Gly Pro Arg lie Thr Asp Val 
5475 5480 5485 

Ala Tyr Ser Leu Ala Thr Gly Arg Ser Ala Phe Pro His Arg Ala Val 
5490 5495 5500 

lie Leu Ala Ala Asn Arg Ala T^p Leu Leu His Ser Leu Ser Ala Leu 
505 5510 5515 5520 

Ala Glu Gly His Thr Glu Ala Pro Ala Val Val Ala Gin Asp Arg Ala 
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5525 5530 5535 

Arg Ser Gly Lys Leu Ala Phe Leu Phe Ser Gly Gin Gly Ser Gin Arg 
5540 554S 5550 

Leu Gly Met Gly Arg Glu Leu Tyr Gly Arg Tyr Pro Ala Phe Ala Glu 
5555 5560 5565 

Ala Leu Asp Ala Val Cys Ala Ala Leu Asp Ala His Leii Asp Arg Pro 
5570 5575 5580 

Leu Arg Asp Val lie Trp Gly Glu Asp Ala Glu Leu Leu^Asn Arg Thr 
585 5590 5595 5600 

Gly Tyr Ala Gin Thr Gly Leu Phe Ala lie Glu Val Ala Leu Phe Arg 
5605 5610 5615 

Leu Leu Glu Ser Trp Gly Val Arg Pro Asp His Leu Leu Gly His Ser 
5620 5625 &630 

lie Gly Glu lie Ala Ala Ala His Val Ala Gly Val Leu Ser Leu Pro 
5635 5640 5645 

Asp Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gin Leu 
5650 5655 5660 

Pro Ser Gly Gly Ala Met Met Ala lie Arg Ala Thr Glu Asp Glu Val 
665 5670 5675 5680 

Leu Pro His Leu Ala Glu Gly Val Ser Leu Ala Ala Val Asn Gly Pro 
5685 5690 5695 

Ser Ser Val Val Val Ser Gly Ala Glu Asp Glu Val Leu Ala Leu Ala 
5700 5705 5710 

Ala His Phe Glu Glu Glu Gly Arg Lys Thr Thr Arg Leu Arg Val Ser 
5715 5720 5725 

His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu Ala Asp Phe Arg 
5730 5735 5740 

Ala Val Ala Asp Gly Met Thr Tyr Ala Ala Pro Arg lie Pro Val Val 
745 5750 5755 5760 

Ser Asn Val Thr Gly Arg Pro Ala Thr Ala Glu Glu Leu Cys Cys Ala 
5765 5770 5775 

Glu Tyr Trp Val Gly His Val Arg Glu Ala Val Arg Phe Ala Asp Gly 
5780 5785 5790 

Val Gly Ala Leu Arg Glu Gin Gly Val Thr Thr Phe Leu Glu Leu Gly 
5795 5800 5805 

Pro Asp Gly Ser Leu Ser Ala Leu Ala Ala Glu Ser Ala Ala Asp Asp 
5810 5815 5820 

Ser Val Leu Ala Pro Val Leu Arg Lys Asn Arg Pro Glu Ala Pro Ala 
825 5830 5835 5840 

Leu Leu Thr Ala Leu Ala Arg Leu His Ala Gin Gly Thr Pro Val Asp 
5845 5850 5855 
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Trp Ser Ala Ala Phe Ala Gly Thr Gly TVla Arg Trp Val Asp Leu Pro 
5860 5865 5870 

Thr Tyr Ala Phe Gin His Glu Arg Phe Trp Pro Ser Gly Gly Ala Ala 
5875 5880 5885 

Arg Ala Gly Asp Val Arg Ser Ala Gly Leu Gly Ser Ala Gly His Pro 
5890 5895 5900 

Leu Leu Gly Ala Ala Val Glu Leu Ala Gly Ser Gly Gly Arg Leu Leu 
905 5910 5915 5920 

Thr Gly Arg Leu Ser Leu Ser Ser His Pro Trp Leu Ala Asp His Val 
5925 5930 5935 

Val Leu Gly Ser Val Leu Val Pro Gly Thr Ala Leu Met Glu Leu Val 
5940 5945 5950 

Leu Arg Ala Ala Asp Glu Val Asp Cys Ala Ala Val Asp Glu Leu Thr 
5955 5960 5965 

Leu Ala Ala Pro Leu Val Leu Pro Ala Ser Gly Ala Ala lie Gin Val 
5970 5975 5980 

Gin Val Trp Val Gly Glu Pro Asp Glu Ala Gly Arg Arg Pro Val Ser 
985 5990 5995 6000 

Val His Ala Arg Glu Gly Glu Gly Pro Trp Thr Leu His Ala Asp Gly 
6005 6010 6015 

Ala Leu Ala Pro Ala Ala Glu Thr Val Pro Phe Asp Thr Ala lie Trp 
6020 6025 6030 

Pro Pro Gin Gly Ala Glu His Leu Asp Ala Ala Gly Cys Tyr Glu Arg 
6035 6040 6045 

Phe Ala Asp Ala Gly Phe Ala Tyr Gly Pro Val Phe Gin Gly Leu Arg 
6050 6055 6060 

Ala Ala Trp Lys Leu Gly Glu Asp lie Tyr Ala Glu Val Ala Leu Pro 
065 6070 6075 6080 

Glu Gly Thr Asp Gly Asn Ala Tyr Gly Leu His Pro Ala Leu Phe Asp 
6085 6090 6095 

Ala Ala Leu His Ala Ala Leu Leu Gly Gly Glu Gly Thr Asp Glu Ala 
6100 6105 6110 

Ala Val Pro Phe Ser Trp Asn Gly Val Thr Leu His Ala Thr Gly Ala 
6115 6120 6125 

Ser Arg Val Arg Val Arg lie Arg Pro Thr Glu Gly Gly Thr Ser lie 
6130 6135 6140 

Ala Leu Val Asp Thr Ala Gly Ala Pro Val Ala Ser Val Arg Ser Leu 
145 6150 6155 6160 

Thr Ala Arg Pro lie Thr Ala Gly Gin Leu Gin Thr Gly Asp Arg Asp 
6165 6170 6175 
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Ser Leu Phe Gin Val Asp Trp Thr Thr Leu His Leu Thr Asp Glu Arg 
6180 6185 6190 

Ala Asn Ser Leu Ala Leu Leu Gly Lys Asp Thr Glu Gly He Leu Asp 
6195 6200 6205 

Thr Leu Ser Leu Gin Pro His Ala Asp Leu Asp Asp Leu Ala Ala Thr 
6210 . 6215 6220 

Gly Val His Asp Thr Val Leu Ala Pro Leu Pro Thr Arg Thr Ala Gly 
225 6230 6235 6240 

Thr Val Glu Ser Val His Ala Ala Thr Thr Gly Ala Leu Ala Leu He 
6245 6250 6255 

Arg Ser Trp Leu Ala Asp Asp Arg Phe Ala Ala Ser Arg Leu Val Phe 
6260 6265 6270 

Val Thr Arg Gly Ala Val Ser Gly Thr Asp Leu Ala ciy Ala Ser Val 
6275 6280 6285 

Trp Gly Leu Val Arg Ser Ala Leu Leu Glu His Pro Gly Arg Phe Gly 
6290 6295 6300 

Leu Val Asp Val Asp Val Asp Gin Asp Ala Glu Val Pro Leu Val Pro 
305 6310 6315 6320 

Arg Ala Leu Ala Ser Asp Glu Pro Gin Val Leu Val Arg Gly Gly Glu 
6325 6330 6335 

Val Leu Ala Ala Arg Leu Val Arg Ala Gin Ser Ser Asp Thr Val Thr 
6340 6345 6350 

Trp Asp Pro Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Gly Leu 
6355 6360 6365 

Gly Arg Ser Val Ala Arg His Leu Val Ser Glu His Gly Val Arg Ser 
6370 6375 6380 

Leu Leu Leu Val Ser Arg Arg Gly Pro Ala Ala Glu Gly Val Asp Ala 
385 6390 6395 6400 

Leu Val Ala Glu Leu Ala Glu Cys Gly Ala Gin Val Thr Val Glu Ala 
6405 6410 6415 

Cys Asp Val Thr Asp Ala Val Ala Val Ala Asp Leu Val Ala Arg His 
6420 6425 6430 

Arg He Ser Ala Val Val His Thr Ala Gly Val Leu Asp Asp Gly Val 
6435 6440 6445 

Val Glu Ser Leu Thr Pro Glu Arg Leu Ser Ala Val Leu Arg Pro Lys 
6450 6455 6460 

Val Asp Ala Ala Trp Asn Leu His Glu Ala Thr Arg Gly Leu Asp Leu 
465 6470 6475 6480 

Asp Ala Phe Val Val Phe Ser Ser Val Pila. Gly Thr Phe Gly Ser Ala 
6485 6490 6495 

Gly Gin Ala Asn Tyr Ala Ala Gly Asn Ala Phe Leu Asp Ala Leu Ala 
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6500 



6505 



6510 



Tyr His Arg Arg Ala Val Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
6515 6520 6525 

Pro Trp Ser Gin Asp Gly Gly Met Thr Gly Thr Leu Ser Asp Ala Asp 
6530 6535 6540 

Val Gin Arg He Ala Arg Gin Gly Met Pro Pro Leu Thr Val Glu Glu 
545 6550 6555 6560 

Gly Leu Ala Leu Phe Asp Ala Ala Leu Gly Ser Ala Glu Pro Met Ala 
6565 6570 * 6575 

Leu Pro Val Arg Leu Asp Leu Ala Ala Leu Arg Ala Gin Gly Glu Pro 
6580 6585 6590 

Gin Pro Leu Leu Arg Gly Leu He Arg Thr Pro Gly Arg Arg Thr Ala 
6595 6600 6605 

Ala Ala Ala Thr Glu Gly Asp Thr Ala Ala Ala Phe Ala Gly Arg Leu 
6610 6615 6620 

Thr Gly Leu Ser Ala Ala Glu Gly Arg Glu Val Val Leu Gly Ala Val 
625 6630 6635 6640 

Arg Ser Gin He Ala Gly Val Leu Gly His Ala Glu Ala Thr Glu He 
6645 6650 6655 

Asp Gin Asp Arg Ala Phe Leu Asp Leu Gly Phe Asp Ser Leu Thr Ala 
6660 6665 6670 

Val Glu Leu Arg Asn Arg Leu Gly Ala Val Thr Gly He Arg Leu Pro 
6675 6680 6685 

Ala Thr Leu Leu Phe Asp Tyr Pro Thr Pro Ala Glu Leu Val Ala His 
6690 6695 6700 

Leu His Ala Arg He Ala Pro Glu Pro Thr Val Gly Pro Glu Ala Leu 
705 6710 6715 6720 

Leu Gly Glu Leu Glu Arg Met Glu Lys Ser Phe Gly Gly Leu Asp Leu 
6725 6730 6735 

Thr Glu Glu Met His Glu Gin He Ala Gly Arg Leu Glu Val Leu Arg 
6740 6745 6750 

Ala Lys Trp Asp Ala Leu Axg Asp Thr Ala Ala Ala Ala Gly His Asp 
6755 6760 6765 

Gly Ser Pro Ser Asp Glu Asp Phe Asp Phe Glu Ser Ala Ser Asp Asp 
6770 6775 6780 

Glu Val Phe Asp Leu Leu Asp Asn Glu Leu Gly Leu Ser 
785 6790 6795 



<210> 5 

<211> 1650 

<212> DNA 

<213> Streptomyces natalensis 
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<220> 

<221> CDS 

<222> (X) . . (1650) 

<223> ORFl 

<400> 5 

atg ttc gag aac cag cat ctg tea egg cgc cgt ctg etc gga ctg gee 4 8 

Met Phe Glu Asn Gin His Leu Ser Arg Arg Arg Leu Leu Gly Leu Ala 
15 10 15 

gcc etc ggc ggc gee gca gee gee gga atg ace aeg ate ace tec gee 96 
Ala Leu Gly Gly Ala Ala Ala Ala Gly Met Thr Thr lie Thx Ser Ala 
20 25 30 

cct cat gcc gcg gcc gcc gac egg cgc agt ccg cag gcc cgc age ggc 144 
Pro His Ala Ala Ala Ala Asp Arg Arg Ser Pro Gin Ala Arg Ser Gly 
35 40 45 

teg ttc gta ccg gcc gtg gtg ate ggt aeg gga tac ggc gee gcg gte 192 
Ser Phe Val Pro Ala Val Val lie Gly Thr Gly Tyr Gly Ala Ala Val 
50 55 60 

tec gcg ctg egg etc ggc gag gee gga att eee aeg etc atg etc gaa 240 
Ser Ala Leu Arg Leu Gly Glu Ala Gly lie Pro Thr Leu Met Leu Glu 
65 70 75 80 

atg gge eag ctg tgg aac aag ccc gcc gac gac ggc aac gtc ttc tgc 288 
Met Gly Gin Leu Trp Asn Lys Pro Ala Asp Asp Gly Asn *yal Phe Cys 
85 90 95 

gga atg etc teg ccc gac cgc cgc tec age tgg ttc aag tee cgc ace 336 
Gly Met Leu Ser Pro Asp Arg Arg Ser Ser Trp Phe Lys Ser Arg Thr 
100 105 110 

gag gee ecg etc gge teg tte ctg tgg ctg gat gtg ate aac cgc gac 384 
Glu Ala Pro Leu Gly Ser Phe Leu Trp Leu Asp Val lie -Asn Arg Asp 
115 120 125 

ate gac ccg tac gcg gga gtg ctg gac aag gtg cac tte gac eag atg 432 
lie Asp Pro Tyr Ala Gly Val Leu Asp Lys Val His Phe Asp Gin Met 
130 135 140 

teg gtg tac gtg ggg egg ggt gte gge gge gge teg ctg gtc aac ggc 480 
Ser Val Tyr Val Gly Arg Gly Val Gly Gly Gly Ser lieu Val Asn Gly 
145 150 155 160 

ggg atg gcc gte gta ccg aag cgc teg tac ttc gag gag gte etc ccg 528 
Gly Met Ala Val Val Pro Lys Arg Ser Tyr Phe Glu Glu Val Leu Pro 
165 170 175 

egg gtg gac gcc gee gag atg tac gac egg tac ttc ccg cgc gee aac 576 
Arg Val Asp Ala Ala Glu Met Tyr Asp Arg Tyr Phe Pro Arg Ala Asn 
180 185 190 

tec atg etc aag gtg aac cac ate gac aag ggg tgg ttc gag gag aeg 624 
Ser Met Leu Lys Val Asn His lie Asp Lys Gly Trp Phe Glu Glu Thr 
195 200 205 

gag tgg tac aag ttc gcg egg gtc teg cgc gag eag gcg ggc aag gcg 672 
Glu Trp Tyr Lys Phe Ala Arg Val Ser Arg Glu Gin Ala Gly Lys Ala 
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210 215 220 

ggc ctg ggc acc acc ttc gtc ccc aac gtc tac gac ttc gac tac atg 72 0 
Gly Leu Gly Thr Thr Phe Val Pro Asn Val Tyr Asp Phe Asp Tyr Met 
225 230 235 240 

egg cgc gag gcg aac ggt gag teg ccc aag tec gcg ctg gcg acc gag 768 
Arg Arg Glu Ala Asn Gly Glu Ser Pro Lys Ser Ala Leu Ala Thr Glu 
245 250 255 

gtc ate tac ggc aac aac cac ggc aaa cag age ctg gac aag acc tac 816 
Val lie Tyr Gly Asn Asn His Gly Lys Gin Ser Leu Asp Lys Thr Tyr 
260 265 270 

ctg gee gee gcg etc ggc acc ggc aag gtc acc ate gag acc ctg cac 864 
Leu Ala Ala Ala Leu Gly Thr Gly Lys Val Thr He Glu Thr Leu His 
275 280 285 

cag gtc agg gcg ate cac cag cag ccg gac ggc age tac gtg ctg tec 912 
Gin Val Arg Ala He His Gin Gin Pro Asp Gly Ser Tyr Val Leu Ser 
290 295 300 

gtg gac cag ate gac acg gee ggc cag acc gtc gee cac aag gag ate 960 
Val Asp Gin He Asp Thr Ala Gly Gin Thr Val Ala His; Lys Glu He 
305 310 315 320 

tec tgc cgt cac ctg ttc etc ggc gee ggc age etc ggc, tec acc gaa 1008 
Ser Cys Arg His Leu Phe Leu Gly Ala Gly Ser Leu Gly-. Ser Thr Glu 
325 330 335 

ctg ctg gtg cge gee egg gac ace ggc gcg ctg ccc gac etc aac gee 1056 
Leu Leu Val Arg Ala Arg Asp Thr Gly Ala Leu Pro Asp Leu Asn Ala 
340 345 350 

gag gtc ggc gcg ggc tgg ggc ccc aac ggc aac ate atg acc ggc egg 1104 
Glu Val Gly Ala Gly Trp Gly Pro Asn Gly Asn He Met Thr Gly Arg 
355 360 365 

gee aac cac gtc tgg aac ccc acc ggg gee cac cag tec teg ate ccc 1152 
Ala Asn His Val Trp Asn Pro Thr Gly Ala His Gin Ser Ser He Pro 
370 375 380 

get ctg ggc ate gac gac tgg aac aac ccc acc gee ccg gtc ttc gee 1200 
Ala Leu Gly He Asp Asp Trp Asn Asn Pro Thr Ala Pro Val Phe Ala 
385 390 395 400 

gaa ate gee ccg atg ccc gee ggg ttg gag acc tgg gtc age etc tat 1248 
Glu He Ala Pro Met Pro Ala Gly Leu Glu Thr Trp Val Ser Leu Tyr 
405 410 415 

Ctg gcg ate acc aag aac ccc gag cgc ggc acc ttc gtc tac gac aag 1296 
Leu Ala He Thr Lys Asn Pro Glu Arg Gly Thr Phe Val Tyr Asp Lys 
420 425 430 

gee acc gac egg gee gcg ctg cgc tgg acg egg gac cag aac acg ccc 1344 
Ala Thr Asp Arg Ala Ala Leu Arg Trp Thr Arg Asp Gin Asn Thr Pro 
435 440 445 

gcg gtc aac gee gee agg teg etc ttc gac cgc ate aac aag gee aac 13 92 
Ala Val Asn Ala Ala Arg Ser Leu Phe Asp Arg He Asn Lys Ala Asn 
450 455 460 
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ggc acg atg tac cgc tac gac ctg ttc ggg ccg cag ctg aag aac ttc 1440 
Gly Thr Met Tyr Arg Tyr Asp Leu Phe Gly Pro Gin Leu Lys Asn Phe 
465 470 475 480 

tec gac gac ttc tgc tac cac ccg etc ggc ggc tgc gtc ctg ggc aag 1488 
Ser Asp Asp Phe Cys Tyr His Pro Leu Gly Gly Cys Val Leu Gly Lys 
485 490 495 

gcc acc gac ggg tac ggc egg gtc gee ggc tac cac aac etc tac gtc 1536 
Ala Thr Asp Gly Tyr Gly Arg Val Ala Gly Tyr His Asn Leu Tyr Val 
500 505 510 

acg gac ggc gcg etc ate ccg ggg tee ate ggg gtc aac ece ttc gtg 1584 
Thr Asp Gly Ala Leu lie Pro Gly Ser lie Gly Val Asn Pro Phe Val 
515 520 525 

acc ate acg gcg ctg gee gag egg aac ate gag egg ate ate gcg gag 1632 
Thr lie Thr Ala Leu Ala Glu Arg Asn lie Glu Arg lie lie Ala Glu 
530 535 540 

gac gtc aag gee gcc tag 1650 
Asp Val Lys Ala Ala 
545 550 



<210> 6 
<211> 550 
<212> PRT 

<213> Streptomyces natalensis 
<400> 6 

Met Phe Glu Asn Gin His Leu Ser Arg T^g Arg Leu Leu Gly Leu Ala 
15 10 15 

Ala Leu Gly Gly Ala Ala Ala Ala Gly Met Thr Thr lie Thr Ser Ala 
20 25 30 

Pro His Ala Ala Ala Ala Asp Arg Arg Ser Pro Gin Ala Arg Ser Gly 
35 40 45 

Ser Phe Val Pro Ala Val Val lie Gly Thr Gly Tyr Gly Ala Ala Val 
50 55 60 

Ser Ala Leu Arg Leu Gly Glu Ala Gly lie Pro Thr Leu Met Leu Glu 
65 70 75 80 

Met Gly Gin Leu Trp Asn Lys Pro Ala Asp Asp Gly Asn Val Phe Cys 
85 90 95 

Gly Met Leu Ser Pro Asp Arg Arg Ser Ser Trp Phe Lys Ser Arg Thr 
100 105 110 

Glu Ala Pro Leu Gly Ser Phe Leu Trp Leu Asp Val lie Asn Arg Asp 
115 120 125 

lie Asp Pro Tyr Ala Gly Val Leu Asp Lys Val His Phe Asp Gin Met 
130 135 140 

Ser Val Tyr Val Gly Arg Gly Val Gly Gly Gly Ser Leu Val Asn Gly 
145 150 155 160 
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Gly Met Ala Val Val Pro Lys Arg Ser Tyr Phe Glu Glu Val Leu Pro 
165 170 175 

Arg Val Asp Ala Ala Glu Met Tyr Asp Arg Tyr Phe Pro Arg Ala Asn 
180 185 190 

Ser Met Leu Lys Val Asn His lie Asp Lys Gly Trp Phe Glu Glu Thr 
195 200 205 

Glu Trp Tyr Lys Phe Ala Arg Val Ser Arg Glu Gin Ala Gly Lys Ala 
210 215 220 

Gly Leu Gly Thr Thr Phe Val Pro Asn Val Tyr Asp Phe Asp Tyr Met 
225 230 235 240 

Arg Arg Glu Ala Asn Gly Glu Ser Pro Lys Ser Ala Leu Ala Thr Glu 
245 250 255 

Val lie Tyr Gly Asn Asn His Gly Lys Gin Ser Leu Asp Lys Thr Tyr 
260 265 270 

Leu Ala Ala Ala Leu Gly Thr Gly Lys Val Thr lie Glu Thr Leu His 
275 280 285 

Gin Val Arg Ala lie His Gin Gin Pro Asp Gly Ser Tyr Val Leu Ser 
290 295 300 

Val Asp Gin He Asp Thr Ala Gly Gin Thr Val Ala His Lys Glu He 
305 310 315 320 

Ser Cys Arg His Leu Phe Leu Gly Ala Gly Ser Leu Gly Ser Thr Glu 
325 330 335 

Leu Leu Val Arg Ala Arg Asp Thr Gly Ala Leu Pro Asp Leu Asn Ala 
340 345 350 

Glu Val Gly Ala Gly Trp Gly Pro Asn Gly Asn He Met . Thr Gly Arg 
355 360 365 

Ala Asn His Val Trp Asn Pro Thr Gly Ala His Gin Ser . Ser He Pro 
370 375 380 

Ala Leu Gly He Asp Asp Tarp Asn Asn Pro Thr Ala Pro. Val Phe Ala 
385 390 395 400 

Glu He Ala Pro Met Pro Ala Gly Leu Glu Thr Trp Val Ser Leu Tyr 
405 410 415 

Leu Ala He Thr Lys Asn Pro Glu Arg Gly Thr Phe Val Tyr Asp Lys 
420 425 430 

Ala Thr Asp Arg Ala Ala Leu Arg Trp Thr Arg Asp Gin Asn Thr Pro 
435 440 445 

Ala Val Asn Ala Ala Arg Ser Leu Phe Asp Arg He Asn Lys Ala Asn 
450 455 460 

Gly Thr Met Tyr Arg Tyr Asp Leu Phe Gly Pro Gin Leu Lys Asn Phe 
465 470 475 480 
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Ser Asp Asp Phe Cys Tyr His Pro Leu 
485 

Ala Thr Asp Gly Tyr Gly Arg Val Ala 
500 505 

Thr Asp Gly Ala Leu lie Pro Gly Ser 
515 520 

Thr lie Thr Ala Leu Ala Glu Arg Asn 
530 535 

Asp Val Lys Ala Ala 
545 550 




PCT/EPOO/06227 



Gly Gly Cys Val Leu Gly Lys 
490 495 

Gly Tyr His Asn Leu Tyr Val 
510 

lie Gly Val Asn Pro Phe Val 
525 

He Glu Arg He He Ala Glu 
540 



<210> 7 
<211> 1197 
<212> DNA 

<213> Streptomyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (1197) 

<223> ORF2 

<400> 7 

atg acg tac aca gac ccg gcc gcg ccc gag acg gat ccg cog gcc otc 48 

Met Thr Tyr Thr Asp Pro Ala Ala Pro Glu Thr Asp Pro Pro Ala Val 
15 10 15 

gac ttt ccg cag cgc aag ccc ggc gtg ccg ttc ccg ccg ccc gac tac 96 
Asp Phe Pro Gin Arg Lys Pro Gly Val Pro Phe Pro Pro Pro Asp Tyr 



20 25 



30 



gcc gac tac cgc gac egg aag ggg etc gtc etc teg cag ctg tec gac 144 
Ala Asp Tyr Arg Asp Arg Lys Gly Leu Val Leu Ser Gin Leu Ser Asp 
35 40 



45 



ggc aaa egg gta tgg ctg gtc acc egg eac gag gac gta cgc gcc gta 192 
Gly Lys Arg Val Trp Leu Val Thr Arg His Glu Asp Val Arg Ala Val 
50 55 60 



etg acc age ccg age ate age teg aac ccc gag cac aag gga ttt ccc 
Leu Thr Ser Pro Ser He Ser Ser Asn Pro Glu His Lys Gly Phe Pro 
65 70 



240 



75 80 



aac gtc ggg aac ctg ggt gtg ccc aag cag gac cag ate ccg ggc tgg 288 
Asn val Gly Asn Leu Gly Val Pro Lys Gin Asp Gin He Pro Gly tS 
85 90 95 

If^ H*"? ^f"" ^^"^ csrS ttc cgc aag gcc etc 336 

Phe Val Gly Met Asp Ser Pro Glu His Asp Arg Phe Arg Lys Ala Leu 
100 105 110 

T^""" ^l"" ^""^ "=^3 ^'^9 cc«= 9cg ate 384 

He Pro Glu Phe Thr Val Arg Arg Val Arg Ala Met Lys Pro Ila He 
115 120 125 

gaa cgc acg gtg gac gcc caa etg gac gcc atg ctg gcc gcg ggc aac 432 
Glu Arg Thr Val Asp Ala Gin Leu Asp Ala Met Le2 La 111 lly tin 
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130 135 140 

acc gcc gac etc gtc gcc gac ttc gcc ctg ccc ate cee tec ctg gtg 480 
Thr Ala Asp Leu Val Ala Asp Phe Ala Leu Pro He Pro Ser Leu Val 
145 150 155 160 

ate tec gca etg etc ggc gtg ccg ccc gcc gac cgc gag ttc ttc gag 52 8 
He Ser Ala Leu Leu Gly Val Pro Pro Ala Asp Arg Glu Phe Phe Glu 
165 170 175 

tec agg acc cgc gtc ctg gtc tec etc cgc tec tec acc gac gac gac 576 
Ser Arg Thr Arg Val Leu Val Ser Leu Arg Ser Ser Thr Asp Asp Asp 
180 185 190 

egg atg gcc gcc gcc aag gac etc ctg egg tac ate aac egg etc gtg 624 
Arg Met Ala Ala Ala Lys Asp Leu Leu Arg Tyr He Asn Arg Leu Val 
195 200 205 

gag ate aaa cag aag tgg ggc ggc gac gac etc ate acc egg ctg ctg 672 
Glu He Lys Gin Lys Trp Gly Gly Asp Asp Leu He Thr Arg Leu Leu 
210 215 220 

gee ace ggb gcc ate gcc ccc eac gaa atg tec ggc gtg etg atg etc 720 
Ala Thr Gly Ala He Ala Pro His Glu Met Ser Gly Val Leu Met Leu 
225 230 235 240 

Ctg etc ate gee ggc eac gag acc acg gcc aac aac ate gcc etc ggc 768 
Leu Leu He Ala Gly His Glu Thr Thr Ala Asn Asn He Ala Leu Gly 
245 250 255 

gtg gtc acc ctg ctg gcg aac ccc caa tgg ate ggc gac gac egg gcc 816 
Val Val Thr Leu Leu Ala Asn Pro Gin Trp He Gly Asp Asp Arg Ala 
260 265 270 

gtg gag gag acc ctg cgc ttc eac tec gtc gcc gac etg gtg tec etg 864 
Val Glu Glu Thr Leu Arg Phe His Ser Val Ala Asp Leu Val Ser Leu 
275 280 285 

cgc gtc gcg gtc cag gac gtg gaa ate gee ggg cag etc ate aag gcg 912 
Arg Val Ala Val Gin Asp Val Glu He Ala Gly Gin Leu He Lys Ala 
290 295 300 

ggc gag gga ate gtg ccg ctg gtc gee gcc gcc aat eat gac gag aac 960 
Gly Glu Gly He Val Pro Leu Val Ala Ala Ala Asn His Asp Glu Asn 
305 310 315 320 

gcc ttc gaa tgc ccc eac gcc ttc gac ccg tec egg tee gee cgc eac 1008 
Ala Phe Glu Cys Pro His Ala Phe Asp Pro Ser Arg Ser Ala Arg His 
325 330 335 

eat gtg gcc ttc ggc tac ggc gta eac caa tgc ctg gga cag aac etg 1056 
His Val Ala Phe Gly Tyr Gly Val His Gin Cys Leu Gly Gin Asn Leu 
340 345 350 

gtg egg ate gag atg gaa gtc gcg tac egg aaa etc ttc gag cgc ate 1104 
Val Arg He Glu Met Glu Val Ala Tyr Arg Lys Leu Phe Glu Arg He 
355 360 365 

ccg aac etc gaa etc gcc gtg ccc ace gac ggg ttg gac ate aag tac 1152 
Pro Asn Leu Glu Leu Ala Val Pro Thr Asp Gly Leu Asp He Lys Tyr 
370 375 380 
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gac ggc gtg etc tac ggc ctg aac gag ctg ccc gtc cgc tgg tag 1197 
Asp Gly Val Leu Tyr Gly Leu Asn Glu Leu Pro Val Arg Trp 
385 390 395 



<210> B 
<211> 399 
<212> PRT 

<213> Streptomyces natalensis 
<400> 8 

Met Thr Tyr Thr Asp Pro Ala Ala Pro Glu Thr Asp Pro Pro Ala Val 
15 10 15 

Asp Phe Pro Gin Arg Lys Pro Gly Val Pro Phe Pro Pro Pro Asp Tyr 
20 25 30 

Ala Asp Tyr Arg Asp Arg Lys Gly Leu Val Leu Ser Gin Leu Ser Asp 
35 40 45 

Gly Lys Arg Val Trp Leu Val Thr Arg His Glu Asp Val Arg Ala Val 
50 55 60 

Leu Thr Ser Pro Ser lie Ser Ser Asn Pro Glu His Lys Gly Phe Pro 
65 70 75 80 

Asn Val Gly Asn Leu Gly Val Pro Lys Gin Asp Gin lie Pro Gly Trp 
85 90 95 

Phe Val Gly Met Asp Ser Pro Glu His Asp Arg Phe Arg Lys Ala Leu 
100 105 110 

He Pro Glu Phe Thr Val Arg Arg Val Arg Ala Met Lys Pro Ala He 
115 120 125 

Glu Arg Thr Val Asp Ala Gin Leu Asp Ala Met Leu Ala Ala Gly Asn 
130 135 140 

Thr Ala Asp Leu Val Ala Asp Phe Ala Leu Pro He Pro Ser Leu Val 
145 150 155 160 

He ser TQa Leu Leu Gly Val Pro Pro Ala Asp Arg Glu Phe Phe Glu 
165 170 175 

Ser Arg Thr Arg Val Leu Val Ser Leu Arg Ser Ser Thr Asp Asp Asp 
180 185 190 

Arg Met Ala Ala Ala Lys Asp Leu Leu Arg Tyr He Asn Airg Leu Val 
195 200 205 

Glu He Lys Gin Lys Trp Gly Gly Asp Asp Leu He Thr Arg Leu Leu 
210 215 220 

Ala Thr Gly Ala He Ala Pro His Glu Met Ser Gly Val "Leu Met Leu 
225 230 235 240 

Leu Leu He Ala Gly His Glu Thr Thr Ala Asn Asn He Ala Leu Gly 
245 250 255 

Val Val Thr Leu Leu Ala Asn Pro Gin Trp He Gly Asp Asp Arg Ala 
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260 265 270 

Val Glu Glu Thr Leu Arg Phe His Ser Val Ala Asp Leu Val Ser Leu 
275 280 285 

Arg Val Ala Val Gin Asp Val Glu lie Ala Gly Gin Leu lie Lys Ala 
290 295 300 

Gly Glu Gly lie Val Pro Leu Val Ala Ala Ala Asn His Asp Glu Asn 
305 310 315 320 

Ala Phe Glu Cys Pro His Ala Phe Asp Pro Ser Arg Ser Ala Arg His 
325 330 ■ 335 

His Val Ala Phe Gly Tyr Gly Val His Gin Cys Leu Gly Gin Asn Leu 
340 345 350 

Val Arg lie Glu Met Glu Val Ala Tyr Arg Lys Leu Phe Glu Arg lie 
355 360 365 

Pro Asn Leu Glu Leu Ala Val Pro Thr Asp Gly Leu Asp lie Lys Tyr 
370 375 380 

Asp Gly Val Leu Tyr Gly Leu Asn Glu Leu Pro Val Arg Trp 
385 390 395 

<210> 9 
<211> 1194 
<212> DNA 

<213> Streptomyces natalensis 

<220> 

<221> CDS 

<222> (1) . . (1194) 

<223> ORF3 

<400> 9 

atg acc gcc gcc tec cac gac ctg ccc tgc etc aac etc gaa ccg ccc 48 

Met Thr Ala Ala Ser His Asp Leu Pro Cys Leu Asn Leu Glu Pro Pro 

1 5 10 15 

aaa atg ctg aaa ctg age ccg ctg ctg ege gee ttg cag gac egg ggg 96 
Lys Met Leu Lys Leu Ser Pro Leu Leu Arg Ala Leu Gin Asp Arg Gly 
20 25 30 

ccg ate cac egg gtg cgc aca ccc gcc ggg gac gag gcg tgg ctg gtg 144 
Pro lie His Arg Val Arg Thr Pro Ala Gly Asp Glu Ala Trp Leu Val 
35 40 45 

ace cgc cac gcc gag etc aag cag ctg ctg cac gac gag cgc ate ggc 192 
Thr Arg His Ala Glu Leu Lys Gin Leu Leu His Asp Glu Arg lie Gly 
50 55 60 

cgc acg cac ccc gac ccg ccc tec gcc gcc cag tac gta cgc age ccc 24 0 
Arg Thr His Pro Asp Pro Pro Ser Ala Ala Gin Tyr Val Arg Ser Pro 
€5 70 75 80 

ttc ctg gac ctg ctg ate age gac gcc gac gee gag tec ggg cgt egg 288 
Phe Leu Asp Leu Leu He Ser Asp Ala Asp Ala Glu Ser Gly Arg Arg 
85 90 95 
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cag cac gcc gag acc cgc cgc ctg etc act ccg ttg ttc teg gcc egg 336 
Gin His Ala Glu Thr Arg Arg Leu Leu Thr Pro Leu Phe Ser Ala Arg 
100 105 110 

cgc gtt ctg gaa atg cag ccg aag gtg gag gag gcc gcg gac acc ctg 384 
Arg Val Leu Glu Met Gin Pro Lys Val Glu Glu Ala Ala Asp Thr Leu 
lis 120 125 

ctg gac gcg ttc ate gee cag ggg cet cec ggc gac ctg cac ggc gag 432 
Leu Asp Ala Phe lie Ala Gin Gly Pro Pro Gly Asp Leu His Gly Glu 
130 135 140 

etc ace gtg eeg ttc gee etc acg gtc etc tge gag gtc ate ggc gtg 480 
Leu Thr Val Pro Phe Ala Leu Thr Val Leu Cys Glu Val lie Gly Val 
145 150 155 160 

ccg ccg cag cgc cgc gcg gag ctg acc aca ctg ctg gcc ggt ate gcc 528 
Pro Pro Gin Arg Arg Ala Glu Leu Thr Thr Leu Leu Ala Gly lie Ala 
165 170 175 

aag ctg gac gac cgc gag ggc gcc gta egg gca cag gac gac ctg ttc 576 
Lys Leu Asp Asp Arg Glu Gly Ala Val Arg Ala Gin Asp Asp Leu Phe 
180 185 190 

ggg tac gtg gca ggg ctg gtc gag cac aag egg gcc gag cec ggc cca 624 
Gly Tyr Val Ala Gly Leu Val Glu His Lys Arg Ala Glu Pro Gly Pro 
195 200 205 

gac ate ate tec egg ctg aac gac ggc gag ctg ace gag gac cgc gtg 672 
Asp lie lie Ser Arg Leu Asn Asp Gly Glu Leu Thr Glu Asp Arg Val 
210 215 220 

gca cac ctg gcc atg ggc ctg ctg ttc gee ggg ctg gac age gtc gcg 720 
Ala His Leu Ala Met Gly Leu Leu Phe Ala Gly Leu Asp Ser Val Ala 
225 230 235 240 

age ate atg gac aac ggg gtg gtg ctg ctg gee gcc cac cec gat cag 768 
Ser lie Met Asp Asn Gly Val Val Leu Leu Ala Ala His Pro Asp Gin 
245 250 255 

cgc gcg gcg gcg ctg gee gac cec gac gtg atg gcg cgt gee gtg gag 816 
Arg Ala Ala Ala Leu Ala Asp Pro Asp Val Met Ala Arg Ala Val Glu 
260 265 270 

gag gtg ctg egg ace gcc egg gee ggc ggg teg gtc ctg ccg eeg cgc 864 
Glu Val Leu Arg Thr Ala Arg Ala Gly Gly Ser Val Leu Pro Pro Arg 
275 280 285 

tac gcc age gag gac atg gaa ttc ggc ggg gtg acg ata egg gcc gga 912 
Tyr Ala Ser Glu Asp Met Glu Phe Gly Gly Val Thr lie Arg Ala Gly 
290 295 300 

gac ctg gtc ctg ttc gac etc ggc ctg cec aac ttc gac gag egg gcg 960 
Asp Leu Val Leu Phe Asp Leu Gly Leu Pro Asn Phe Asp Glu Arg Ala 
305 310 315 320 

ttc aca ggg ccg gag gaa ttc gac gcc gcc agg ace cec aat cec eat 1008 
Phe Thr Gly Pro Glu Glu Phe Asp Ala Ala Arg Thr Pro Asn Pro His 
325 330 335 



72 



wo 00/77222 PCT/EPOO/06227 

ctg acc ttc ggc cac ggc ate tgg cac tgc ate gge gee ece cte gcg 1056 
Leu Thr Phe Gly His Gly lie Trp His Cys lie Gly Ala Pro Leu Ala 
340 345 350 

cgc ctg gaa ete agg acg atg ttc acc aag ctg ttc acc cge ctg ccg 1104 
Arg Leu Glu Leu Arg Thr Met Phe Thr Lys Leu Phe Thr Arg Leu Pro 
355 360 365 

gaa ctg cgc ccg gaa ctt ccg gtg gag caa ctg cgc ctg aag gag ggc 1152 
Glu Leu Arg Pro Glu Leu Pro Val Glu Gin Leu Arg Leu Lys Glu Gly 
370 375 380 

cag ctg teg ggc ggc ttc gcc gag etc egg gtg gtc tgg tag 1194 
Gin Leu Ser Gly Gly Phe Ala Glu Leu Arg Val Val Trp 
385 390 395 



<210> 10 
<211> 398 
<212> PRT 

<213> Streptoxnyees natalensis 
<400> 10 

Met Thr Ala Ala Ser His Asp Leu Pro Cys Leu Asn Leu Glu Pro Pro 
15 10 15 

Lys Met Leu Lys Leu Ser Pro Leu Leu Arg Ala Leu Gin Asp Arg Gly 
20 25 30 

Pro lie His Arg Val Arg Thr Pro Ala Gly Asp Glu Ala Trp Leu Val 
35 40 45 

Thr Arg His Ala Glu Leu Lys Gin Leu Leu His Asp Glu Arg lie Gly 
50 55 60 

Arg Thr His Pro Asp Pro Pro Ser Ala Ala Gin Tyr Val Arg Ser Pro 
65 70 75 80 

Phe Leu Asp Leu Leu lie Ser Asp Ala Asp Ala Glu Ser Gly Arg Arg 
85 90 95 

Gin His Ala Glu Thr Arg Arg Leu Leu Thr Pro Leu Phe Ser Ala Arg 
100 105 110 

Arg Val Leu Glu Met Gin Pro Lys Val Glu Glu Ala Ala Asp Thr Leu 
115 120 125 

Leu Asp Ala Phe lie Ala Gin Gly Pro Pro Gly Asp Leu His Gly Glu 
130 135 140 

Leu Thr Val Pro Phe Ala Leu Thr Val Leu Cys Glu Val lie Gly Val 
145 150 155 160 

Pro Pro Gin Arg Arg Ala Glu Leu Thr Thr Leu Leu Ala Gly lie Ala 
165 170 175 

Lys Leu Asp Asp Arg Glu Gly Ala Val Arg Ala Gin Asp Asp Leu Phe 
180 185 190 

Gly Tyr Val Ala Gly Leu Val Glu His Lys Arg Ala Glu Pro Gly Pro 
195 200 205 



73 



wo 00/77222 




PCT/EPOO/06227 



Asp lie lie Ser Arg Leu Asn Asp Gly Glu Leu Thr Glu Asp Arg Val 
210 215 220 

Ala His Leu Ala Met Gly Leu Leu Phe Ala Gly Leu Asp Ser Val Ala 
225 230 235 240 

Ser lie Met Asp Asn Gly Val Val Leu Leu Ala Ala His Pro Asp Gin 
245 250 255 

Arg Ala Ala Ala Leu Ala Asp Pro Asp Val Met Ala Arg Ala Val Glu 
260 265 270 

Glu Val Leu Arg Thr Ala Arg Ala Gly Gly Ser Val Leu Pro Pro Arg 
275 280 285 

Tyr Ala Ser Glu Asp Met Glu Phe Gly Gly Val Thr He Arg Ala Gly 
290 295 300 

Asp Leu Val Leu Phe Asp Leu Gly Leu Pro Asn Phe Asp Glu Arg Ala 
305 310 315 320 

Phe Thr Gly Pro Glu Glu Phe Asp Ala Ala Arg Thr Pro Asn Pro His 
325 330 335 

Leu Thr Phe Gly His Gly He Trp His Cys He Gly Ala Pro Leu Ala 
340 345 350 

Arg Leu Glu Leu Arg Thr Met Phe Thr Lys Leu Phe Thr Arg Leu Pro 
355 360 365 

Glu Leu Arg Pro Glu Leu Pro Val Glu Gin Leu Arg Leu Lys Glu Gly 
370 375 380 

Gin Leu Ser Gly Gly Phe Ala Glu Leu Arg Val Val Trp 
385 390 395 



<210> 11 

<211> 40 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Forward primer 
ermE promoter 

<400> 11 

aaactgcagc tctagaggcg gcttgcgccc gatgctagtc 4 0 



<210> 12 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Reverse primer 
ermE promoter 



<400> 12 
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aaactgcagc tctagatggc ccgggtatcg atcgtcgacg gcatgcggat cctaccaacc 6 0 



<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence: Forward primer ORFl 
<400> 13 

aggattaccc atatgttcga gaaccagcat 30 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence : Reverse primer ORFl 



ggcacgattg 



70 



<400> 14 

gcatgagcgt gggaattccg 



20 



?? 
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